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(54) Tide: CELLULAR IMMUNOGENS USEFUL AS CANCER VACCINES 



(57) Abstract 

A cellular immunogen is provided for immunizing a host against the effects of the product of a target proto-oncogene, where the 
overexpression of the target proto-oncogene is associated with a malignancy. The cellular immunogen comprises host cells which have 
been transacted with at least one transgene construct comprising a transgene cognate to the target proto-oncogene and a strong promoter to 
drive the expression of the transgene in the transfected cells. The transgene encodes a gene product which induces host immunoreactivity 
to host self-determinants of the product of the target proto-oncogene gene. The transgene may comprise, for example, wild-type or mutant 
retroviral oncogene DNA cognate to the target proto-oncogene; or wild-type or mutant proto-oncogene DNA of a species different from 
the host species. The cellular immunogen may be prepared from biopsied host cells, e.g. skin fibroblasts, which are stably or transiently 
transfected with the transgene construct containing the cognate transgene. The host celts transfected with the cognate transgene construct, 
are then returned to the body of the host to obtain expression of the cognate transgene in the host. 
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CELLULAR IMMUNOGENS USEFUL AS CANCER VACCINES" 

Cross-Reference to Related Application 

Priority from U.S. provisional patent application No. 60/010,262, 
filed January 19, 1996 is claimed. 

Field of the Invention 

The invention relates to the field of cancer vaccination and 
immunotherapy. 

Background of the Invention 

A current goal of cancer research is the identification of host 
factors that either predispose to tumor formation or serve to enhance tumor 
growth. 

Genes that confer the ability to convert cells to a tumorigenic 
state are known as oncogenes. The transforming ability of a number of 
retroviruses has been localized in individual viral oncogenes (generally v-onc). 
Cellular oncogenes (generally c-onc) present in many species are related to viral 
oncogenes. It is generally believed that retroviral oncogenes may represent 
escaped and/or partially metamorphosed cellular genes that are incorporated into 
the genomes of transmissible, infectious agents, the retroviruses. 

Some c-onc genes intrinsically lack oncogenic properties, but may 
be converted by mutation into oncogenes whose transforming activity reflects 
the acquisition of new properties, or loss of old properties. Amino acid 
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substitution can convert a cellular proto-oncogene into an oncogene. For 
example, each of the members of the c~ras proto-oncogene family (H-ras, N-ras 
and K-ras) can give rise to a transforming oncogene by a single base mutation. 

Other c-onc genes may be functionally indistinguishable from the 

5 corresponding v-onc, but are oncogenic because they are expressed in much 
greater amounts or in inappropriate cell types. These oncogenes are activated 
by events that change their expression, but which leave their coding sequence 
unaltered. The best characterized example of this type of proto-oncogene is c- 
myc. Changes in MYC protein sequence do not appear to be essential for 

10 oncogenicity. Overexpression or altered regulation is responsible for the 
oncogenic phenotype. Activation of c-myc appears to stem from insertion of a 
retroviral genome within or near the c-myc gene, or translocation to a new 
environment. A common feature in the translocated loci is an increase in the 
level of c-myc expression. 

15 Gene amplification provides another mechanism by which 

oncogene expression may be increased. Many tumor cell lines have visible 
regions of chromosomal amplification. For example, a 20-fold c-myc 
amplification has been observed in certain human leukemia and lung carcinoma 
lines. The related oncogene N-myc is five to one thousand fold amplified in 

20 human neuroblastoma and retinoblastoma. In human acute myeloid leukemia 
and colon carcinoma lines, the proto-oncogene c-myb is amplified five to ten 
fold. While established cell lines are prone to amplify genes, the presence of 
known oncogenes in the amplified regions, and the consistent amplification of 
particular oncogenes in many independent tumors of the same type, strengthens 

25 the correlation between increased expression and tumor growth. 

Immunity has been successfully induced against tumor formation 
by inoculation with DNA constructs containing v-onc genes, or by inoculation 
with v-a/ic proteins or peptides. A series of reports describe a form of 
"homologous" challenge in which an animal test subject is inoculated with either 

30 v-src oncoprotein or DNA constructs containing the v-src gene. Protective 
immunity was induced against tumor formation by subsequent challenge with v- 
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src DNA or v-5rc-induced tumor cells. See, Kuzumaki et aL, JNC1 (1988), 
80:959-962; Wisner et a/., 7. Virol (1991), 65:7020-7024; Halpern et ai, 
Virology (1993), 197:480-484: Taylor et a/., Virology (1994), 205:569-573; 
Plachy et a/., Immunogenetics (1994), 40:257-265. A challenge is said to be 
5 "homologous" where reactivity to the product of a targeted gene is induced by 
immunization with the same gene, the corresponding gene product thereof, or 
fragment of the gene product. A challenge is "heterologous" where reactivity 
to the product of a targeted gene is induced by immunization with a different 
gene, gene product or fragment thereof. 
!0 WO 92/14756 (1992) describes synthetic peptides and oncoprotein 

fragments which are capable of eliciting T cellular immunity, for use in cancer 
vaccines. The peptides and fragments have a point mutation or translocation as 
compared to the corresponding fragment of the proto-oncogene. The aim is to 
induce immunoreactivity against the mutated proto-oncogene, not the wild-type 
15 proto-oncogene. WO 92/14756 thus relates to a form of homologous challenge. 

EP 119.702 (1984) describes synthetic peptides having an amino 
acid sequence corresponding to a determinant of an oncoprotein encoded by an 
oncogenic virus, which determinant is vicinal to an active site of the 
oncoprotein. The active site is a region of the oncoprotein required for 
20 oncoprotein function, e.g., catalysis of phosphorylation. The peptides may be 
used to immunize hosts to elicit antibodies to the oncoprotein active site. EP 
119,702 is thus directed to a form of homologous challenge. 

The protein product encoded by a proto-oncogene constitutes a 
self antigen and, depending on the pattern of its endogenous expression, would 
25 be tolerogenic at the level of T cell recognition of the self peptides of this 
product. Thus, vaccination against cancers which derive from proto-oncogene 
overexpression is problematic. 

Recent attempts have been made to induce immunity in vitro or 
in vivo to the product of the HER-2//a?H proto-oncogene. The proto-oncogene 
30 encodes a 185-kDa transmembrane protein. The HER-2/new proto-oncogene is 
overexpressed in certain cancers, most notably breast cancer. In each report 
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discussed below, the immunogen selected to induce immunity comprised a 
purified peptide of the pl85 HER " 2Mw protein, and not a cellular immunogen. 

Disis et a/., Cancer Res. (1994) 54:16-20 identified several 
breast cancer patients with antibody immunity and CD4+ helper/ inducer T-cell 
5 immunity responses to pl85 HER2/ '"" protein. Antibodies to pl85 HER * 2/rt ^ were 
identified in eleven of twenty premenopausal breast cancer patients. It was 
assumed prior to this work that patients would be immunologically tolerant to 
HER-2y neu as a self -protein and that immunity would be difficult to generate. 

Disis et ai, Cancer Res. (1994) 54:1071-1076 constructed 

10 synthetic peptides identical to pl85 HER * 2/n " protein segments with amino acid 
motifs similar to the published motif for HLA-A2.1 -binding peptides. Out of 
four peptides synthesized, two were shown to elicit peptide-specific cytotoxic 
T-lymphocytes by primary in vitro immunization in a culture system using 
peripheral blood lymphocytes from a normal individual homozygous for HLA- 

15 A2. Thus, it was concluded that the pl85 HER " 2/w " proto-oncogene protein 
contains immunogenic epitopes capable of generating human CD8 + cytotoxic T- 
lymphocytes. 

The cytotoxic T cells elicited in the latter report were not, 
however, shown to recognize tumor cells, but only targets that bound the 

20 synthesized peptides. Other work (Dahl et a/., J. Immunol (1996), 157:239- 
246) has demonstrated that cytotoxic cells may recognize targets that bind 
peptide but fail to recognize targets that endogenously synthesize peptide. It is 
thus unclear whether the cytotoxic cells elicited by Disis et al. would be capable 
of recognizing tumor cells. In any event, no protection against tumor growth 

25 was demonstrated by Disis et ai 

Peoples et ai, Proc. Natl. Acad. Sci. USA (1995), 92:432-436, 
report the identification of antigenic peptides presented on the surface of ovarian 
and breast cancer cells by HLA class I molecules and recognized by tumor- 
specific cytotoxic T lymphocytes. Both HLA-A2-restricted breast and ovarian 

30 tumor-specific cytotoxic T lymphocytes recognized shared antigenic peptides. 
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T cells sensitized against a nine-amino acid sequence of one of the peptides 
demonstrated significant recognition of HLA-A2 HEK2/neu tumors. 

It remains unclear whether Peoples et al have successfully 
attacked proto-oncogene-encoded self, as the immunizing peptide which is 
5 expressed in the tumor cells contained an isoleucine at position 2, whereas the 
peptide expressed in normal tissue contains valine residue at this position. 
Moreover, although stimulation of T cells occurred in vitro, this stimulation 
does not represent a true primary immune response insofar as the starting T cell 
population represented tumor infiltrating lymphocytes. 

10 The research accounts of Disis et al. and Peoples ex al required 

a form of in vixro stimulation, either priming as described by Disis et al , or 
restimulation as described by Peoples ex al The in vixro protocols of Disis et 
al and Peoples et al require a mutant cell line to aid in selection of the peptide 
which will serve to induce reactivity. Non-mutant, peptide antigen-presenting 

15 cells have their HLA class I molecules already loaded with endogenous 
peptides, a phenomenon which precludes exogenous loading from without. The 
value of the mutant lines is that they lack the TAP genes (encoding the 
transporters associated with antigen presentation). Class I binding of internally- 
derived peptides is significantly lowered, and "empty" class I molecules are 

20 present on the cell surface and available for binding of exogenously added 
peptides. This availability of peptide binding sites on membrane-bound class 
I allows examination of whether a given peptide will (i) even bind to class I, 
and (ii) function as a target in cytotoxic T cell assays. However, the need for 
a mutant cell line for deduction of candidate immunizing peptide sequences 

25 limits the usefulness of peptide-based immunization schemes. 

Fendly et al, J. Biol Response Modifiers (1990), 9:449-455 
present an account of a polypeptide-based immunotherapy. Purified polypeptide 
corresponding to the extracellular domain of the pl85 HER2/w ^ protein was 
obtained from a transfected cell line. The purified peptide was employed in the 

30 immunization of guinea pigs. The immunized animals developed a cellular 
immune response, as monitored by delayed-type hypersensitivity. Antisera 
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derived from immunized animals specifically inhibited the in vitro growth of 
human breast tumor cells overexpressing pl85 HER - 2//,<r ". There is no indication 
by Fendly ex al of induction of self versus non-self reactivity. It is likely that 
the guinea pigs were chiefly responding to non-self determinants (as defined in 
5 terms of the guinea pig host) on the human polypeptide immunogen. 

The use of peptides for immunization is of necessity limited to 
immunization with a single haplotype. There are approximately thirty HLA 
types in man. In each case of peptide immunization, one must be careful to 
select peptides which match the host HLA type. The selected peptide must be 
10 immunogenic in the host and be capable of presentation to host immune system 
cells. 

What is needed is an immunization method for immunizing 
humans and animals against self-encoded proto-oncogenes which are associated 
with the development of cancer, which dispenses with the need for isolating 
15 immunogenic, HLA host-matched peptides for immunization. 

Summary of the Invention 

It is an object of the invention to induce reactivity to self- 
determinants of the product of an overexpressed proto-oncogene. 

It is an object of the invention to provide for a form of therapy 
20 or prophylaxis based upon the capacity to induce immune reactivity to proto- 
oncogene-encoded self as overexpressed in tumor cells. 

It is an object of the invention to provide a cellular immunogen 
for use in immunization against self proto-oncogene determinants. 

It is an object of the invention to provide for a method for 
25 vaccinating a host against disease associated with the overexpression of a proto- 
oncogene. 

These and other objects will be apparent from the following 

disclosure. 

A method of vaccinating a host against disease associated with the 
30 overexpression of a target proto-oncogene is provided. The method comprises: 
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(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 

5 and a strong promoter to drive the expression of 

the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene; 
10 . (c) returning the excised cells transfected 

with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 

According to one principal embodiment of the invention, the 
15 transgene comprises wild-type or mutant retroviral oncogene DNA. According 
to another principal embodiment of the invention, the transgene comprises wild- 
type or mutant proto-oncogene DNA of a species different from the host 
species. Where the transgene comprises mutant retroviral oncogene DNA or 
mutant proto-oncogene DNA, the mutant DNA is preferably nontransforming. 
20 The mutant DNA preferably comprises a deletion mutation in a region of the 
DNA which is essential for transformation. Preferably, the host cells are 
transfected with a plurality, most preferably at least five, different transgene 
constructs, each construct encoding a different deletion mutation. 

In one preferred embodiment of the invention, the mutant DNA 
25 has at least about 75% homology, more preferably at least about 80% 
homology, most preferably at least about 90% homology, with the 
corresponding wild-type oncogene or proto-oncogene DNA. 

The invention is further directed to a cellular immunogen for 
immunizing a host against the effects of the product of a target proto-oncogene, 
the overexpression of which is associated with a cancer. The cellular 



30 
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immunogen comprises the host cells which have been transfected with at least 
one transgene construct, as described above. 

The invention is also directed to a method of preparing the 
cellular immunogen, by (a) excising cells from the host, and (b) transfecting the 
5 excised cells with at least one transgene construct, as described above. 

The cells transfected with the transgene are preferably rendered 
non-dividing prior to return to the body of the host. 

The term "corresponds to" is used herein to mean that a 
polynucleotide sequence is homologous (i.e., is identical, not strictly 
10 evolutionarily related) to all or a portion of a reference polynucleotide sequence, 
or that a polypeptide sequence is identical to a reference polypeptide sequence. 

The term "cognate" as used herein refers to a gene sequence that 
is evolutionarily and functionally related between species. For example but not 
limitation, in the human genome, the human c-myc gene is the cognate gene to 
15 the mouse c-myc gene, since the sequences and structures of these two genes 
indicate that they are highly homologous and both genes encode proteins which 
are functionally equivalent. 

By "homology" is meant the degree of sequence similarity 
between two different amino acid sequences, as that degree of sequence 
20 similarity is derived by the FASTA program of Pearson and Lipman, Proc, 
Natl. Acad. Sci. USA (1988), 85:2444-2448, the entire disclosure of which is 
incorporated herein by reference. 

As used herein, the term "operably linked" refers to a linkage of 
polynucleotide elements in a functional relationship. A nucleic acid is "operably 
25 linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the coding sequence. Operably linked 
means that the DNA sequences being linked are typically contiguous and, where 
necessary to join two protein coding regions, contiguous and in reading frame. 
30 The word "transfection" is meant to have its ordinary meaning, 

that is, the introduction of foreign DNA into eukaryotic cells. 



WO 97/25860 



PCT/US97/00582 



- 9 - 

By "transgene" is meant a foreign gene that is introduced into one 
or more host cells. 

By "transgene construct" is meant DNA containing a transgene 
and additional regulatory DNA, such as promoter elements, necessary for the 
5 expression of the transgene in the host cells. 

Description of the Figures 

Fig. 1 is a plot of the mean tumor diameter over time following 
subcutaneous wing web inoculation of 1 -day -old line TK (panel A) and line SC 
(panel B) chickens with 100 /xg of tumorigenic plasmids pc.yrc527 (—a—), 

10 pVSRC-Cl (— ) or pMwsrc ( — ■ --). The mean tumor diameter (mm) at 
a particular time point and for any one group of TK or SC line chickens 
inoculated was computed as the sum of the diameters of the primary tumors 
divided by the number of chickens surviving to that point. The ratios at each 
time point show, for a particular group, the number of chickens bearing 

15 palpable tumors to the total number of survivors to that point (standard typeface 
for pcjrc527, italics for pVSRC-Cl, bold typeface for pMVjrc). Error bars 
(unless obscured by the symbol) indicate standard error. 

Fig. 2 is a plot of the growth of challenge (wing web) tumors in 
test and control line TK chickens under conditions of (i) priming and 

20 homologous challenge with plasmid pcj/r527 (panel A: — A — , test; — a — , 

control), or (ii) priming and homologous challenge with plasmid pVSRC-Cl 
(panel B: — O — , test; — • — , control). Test chickens were primed at 1 day 
posthatch with 100 fxg of construct; test and control chickens were challenged 
at five weeks posthatch with 200 fig of construct. The mean challenge diameter 

25 was computed as in Fig. 1. At each time point the ratio of chickens bearing 
palpable challenge tumors to total number of survivors to that point is indicated 
(standard typeface for control group, bold typeface for test group). The 
statistical comparison between the mean challenge tumor diameters of the test 
versus the control group at a particular time point was made using a two-tailed 

30 student's t test, *(p<0.05), **(p<0.01), ***(p<0.001). The statistical 
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comparison between the ratios of chickens bearing palpable challenge tumors to 
total number of survivors of the test versus the control group at a particular 
time point was made using a chi-squared test; the paired ratios are underlined 
for only those time points where p<0.05. Error bars indicate standard error. 
5 Fig. 3 is a plot of the growth of challenge (wing web) tumors in 

TK chickens under conditions of (i) priming with plasmid pVSRC-Cl and 
heterologous challenge with plasmid pcsrc527 (panel A: — a — , test; — a — , 
control) or (ii) priming with pcsrc521 and heterologous challenge with pVSRC- 
Cl (panel B: — O — , test; — control). Test chickens were primed at 1 

10 day posthatch with 100 /xg of construct; test and control chickens were 
challenged at five weeks posthatch with 200 jig of construct. The mean 
challenge tumor diameter was computed as in Fig. 1. At each time point the 
ratio of chickens bearing palpable challenge tumors to total number of survivors 
to that point is indicated (standard typeface for control group, bold typeface for 

15 test group). Statistical comparisons were made between test and control groups 
at a particular time point as described for Fig. 2. [*(p<0.05), **(p<0.01), 
***(p< 0.001), for the student's t test], and the paired ratios are underlined for 
only those time points where, in the chi-squared test, p<0.05. Error bars 
indicate standard error. 



20 Detailed Description of the Invention 

A vaccination strategy is provided to prevent development of 
cancers. The vaccination method may be carried out on a subject at risk for a 
particular cancer, but before the development of the cancer. The practice of the 
invention may serve for the immunopre vent ion of prevalent human cancers, 

25 such as colon carcinoma, breast carcinoma, and various lymphomas whose 
progress is accompanied by the overexpression of a cellular proto-oncogene. 

The vaccination strategy of the present invention relies on the 
induction of an immune response that targets tumor cells by virtue of the 
recognition of the proto-oncogene-specific antigenicity. The aim of the vaccine 

30 protocol is to induce reactivity to self-determinants of an overexpressed proto- 
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oncogene product. The strategy exploits the structural relatedness between the 
product of the cellular proto-oncogene and that of the product of genes cognate 
to the target proto-oncogene. The cognate gene may comprise a wild-type or 
mutant cognate retroviral oncogene or a wild-type or mutant proto-oncogene 
5 of a species different from the host species. The starting point of the vaccine 
strategy is the high degree of primary sequence homology that exists between 
the protein product of a targeted proto-oncogene and that of its cognate 
retroviral oncogene, or between the proto-oncogene product and the product of 
a cognate proto-oncogene from a different species. However, in contrast to 
10 other proposed vaccine strategies, the present invention is not based on the 
immune recognition of a determinant defined by a cancer specific mutation. 

For those tumors showing proto-oncogene overexpression, this 
sequence homology permits application of the following strategy, which can be 
employed either prophylactically or therapeutically under conditions of cell- 
15 surface expression, or other forms of adjuvanicity, as chosen to enhance 
immunogenicity: (a) immunization of host biopsied cells with a DNA construct 
comprising a transgene cognate to the target proto-oncogene, which transgene 
encodes a gene product which induces host immunoreactivity to host self- 
determinants of the product of the target proto-oncogene; (b) return of the 
20 transfected cells to the body of the host to obtain expression of the transgene in 
the host, and thus immunity against the proto-oncogene product. The invention 
relies on the targeting of a self-determinant found on an overexpressed or 
overabundant proto-oncogene-encoded product. The foreign peptide elements 
of the immunizing oncogene product will trigger peripheral lymphocytes 
25 exhibiting a weak cross reactivity for the self peptides of the targeted proto- 
oncogene product. Although such self peptides would be present in normal cells 
expressing the proto-oncogene, targeting of the tumor cells is favored in view 
of their overexpression of the proto-oncogene. 

The immune strategy exploits the antigenicity of two alternative 
30 types of determinants: (1) tumor-associated antigenic determinant(s) induced as 
a consequence of the activity of the oncogene product, e.g., an enzymatic 
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modification of a cellular protein effected by the oncogene product, or (2) tumor 
associated antigenic determinant(s) intrinsic to the oncogene-encoded product 
itself. The difficulty in exploiting the first alternative by traditional means, i.e., 
antigen purification, is that at present little or no systematic information exists 

5 bearing on the properties of an antigen that, though oncogene-induced, is not 
oncogene-encoded. This situation makes purification of any such antigen 
problematic. However, this problem is obviated from the outset by the present 
invention which utilizes biopsied cells which, as transfected in culture by the 
cognate retroviral oncogene, would express the relevant antigenicity. 

10 In terms of exploiting the second alternative, that of an 

antigenicity intrinsic to the proto-oncogene product, a relevant consideration is 
that the protocol of immunization according to the present invention primes the 
host to determinants of the oncogene product itself. A consequence of this 
immunization is induction of T-cell reactivity to the divergent, i.e foreign, 

15 peptide determinants of the retroviral oncogene product, i.e., those peptide 
determinants that show sequence differences with the positionally homologous 
determinants of the cellular proto-oncogene product. The induction of this 
reactivity does not in itself have vaccine potential, since the foreign 
determinants specific to the retroviral oncogene product are normally absent 

20 from the cellular proto-oncogene product. Nevertheless, the foreign peptide 
elements, notably those that differ by only a single amino acid from the 
positionally homologous self peptides, trigger peripheral T-lymphocytes 
exhibiting a weak cross-reactivity for the self peptides. Although such self 
peptides are present in normal cells expressing the proto-oncogene, targeting of 

25 the tumor cells is favored in view of their overexpression of the proto-oncogene. 

It is possible that many tumor-associated and overexpressed 
proto-oncogenes might possess mutations. In some cases, overexpression may 
very well arise as a direct consequence of one or more of the mutations. 
However, the present vaccination method does not have as its object the 

30 deliberate targeting of non-self determinants generated by proto-oncogene 
mutations. Unlike prior vaccination methods designed to target such mutation- 
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driven non-self determinants, it is the aim of the present invention to induce 
reactivity for self-determinants in the overexpressed product of tumor associated 
and overexpressed proto-oncogenes. 

Prior efforts attempting to elicit reactivity to proto-oncogene self 
5 determinants have relied on in vitro protocols utilizing mutant cell lines to 
identify individual self peptide immunogens (Disis et aL % Cancer Res. (1994) 
54:1071-1076; Peoples et al. , Proc. Natl. Acad. Sci USA (1995), 92:432-436). 
According to the present invention, the host immune system is presented with 
the full array of naturally-derived class I binding peptides. The vaccine strategy 
10 of the present invention obviates the need for any a priori assessment of the 
immunogenicity of individual peptides. 

While the cellular immunogens of the invention display self 
peptides, non-self peptides would also be presented which may serve as more 
effective tolerance breakers. The value of a non-self, but closely related to self, 
15 peptide is that it may more readily activate those T cells that have both a weak 
cross reactivity for the cognate self peptide and an activation threshold 
(determined by the tightness of binding to the T cell receptor) too high to be 
triggered by the self peptide. Moreover, cognate non-self is inductive of a good 
immune response, simply because it does in fact constitute nonself. The non- 
20 self immune response is expected to predispose the induction of the inevitably 
weaker response to the self determinants on the same protein product, since the 
resultant cytokine release provides local help to initiate the weaker anti-self 
response. 

As hereinafter exemplified in a model of srooncogene-based 
25 tumor formation, immunization with cells transfected with a transgene construct 
expressing the v-src oncogene product induces reactivity to the product of the 
c-src proto-oncogene, thereby conferring protection against the growth of 
tumors displaying overexpression of the c-src proto-oncogene. 
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Target Proto-Onco|genes 

According to the present invention, patients with a family history 
of a cancer characterized by the overexpression of a particular proto-oncogene 
are selected for immunization. Alternatively, patients whose tumors can be 
5 shown to overexpress the proto-oncogene are selected. Overexpression of a 
proto-oncogene may derive from an increase over a basal level of transcription. 
Overexpression may also derive from gene amplification, that is, an increase in 
gene copy number, coupled with a basal or elevated level of transcription. 
Proto-oncogene overexpression may be assayed by conventional probing tech- 

10 niques, such as described in Molecular Cloning: A Laboratory Manual J. 
Sambrook el al y eds., Cold Spring Harbor Laboratory Press, 2nd ed. 1989. 
The level of target proto-oncogene expression may be determined by probing 
total cellular RNA from patient cells with a complementary probe for the rele- 
vant mRNA. Total RNA from the patient cells is fractionated in a glyox- 

15 al/agarose gel, transferred to nylon and hybridized to an appropriately labelled 
nucleic acid probe for the target mRNA. The number of relevant mRNA tran- 
scripts found in the patient cells is compared to that found in cells taken from 
the same tissue of a normal control subject. 

As an alternative to measuring mRNA transcripts, the expression 

20 level of a target proto-oncogene may be assessed by assaying the amount of 
encoded protein which is formed. Western blotting is a standard protocol in 
routine use for the determination of protein levels. See Molecular Cloning, 
supra, Chapter 18, incorporated herein by reference. Accordingly, a cell lysate 
or other cell fraction containing protein is electrophoresed on a polyacrylamide 

25 gel, followed by protein transfer to nitrocellulose, and probing of the gel with 
an antibody specific for the protein in question. The probe step permits 
resolution of the desired protein from all other proteins in the starting mixture. 
The bound antibody may be prelabeled, e.g., by a radioisotope such as I25 I, so 
as to permit its detection on the gel. Alternatively, a secondary reagent (usually 

30 an anti-immunoglobin or protein A) may be radiolabeled or covalently coupled 
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to an enzyme such as horseradish peroxidase or alkaline phosphatase. The 
strength of the signal is proportional to the amount of the target protein. The 
strength of the signal is compared with the signal from a sample analyzed in the 
same manner, but taken from normal as opposed to tumor tissue. 
5 A description of the methodology and use of Western blotting to 

determine the levels of the c-s/r-encoded protein pp60 c jrc in adenomatous polyps 
(colonic epithelia) is provided by Cartwright et al. , Proc. NatL Acad. Sci. USA 
(1990), 87:558-562, the entire disclosure of which is incorporated herein by 
reference. 

10 An at least about eight-fold increase in that gene's expression in 

the patient cells compared to expression in normal control cells from the same 
tissue would indicate candidacy for vaccination. 

Table 1 includes a partial list of representative proto-oncogenes, 
the overexpression of which has been associated with one or more malignancies. 

15 Each listed proto-oncogene is a target proto-oncogene according to the present 
invention. The corresponding oncogene, of which the target proto-oncogene is 
the normal cellular homolog, is also identified. This list of target proto- 
oncogenes is intended to be representative, and not a complete list. 

Table 1 

20 Representative List of Target Proto-Oncogenes 

Proto- 

Oncogene Tumor Comments/References 

AKT-2 ovarian v-Akt is the oncogene of the AKT8 virus, which 

induces lymphomas in mice. 
25 1. Bellacosa et a/., (1995) Int. J. Cancer 

64(4):280-5: Southern-blot analysis has shown 
AKT-2 amplification in 12.1% of ovarian 
carcinomas, while Northern bot analysis has 
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revealed overexpression of AKT-2 in 3 of 25 
fresh ovarian carcinomas which were negative for 
AKT-2 amplification. 

2. Cheng et al. y (1996) Proc. Natl. Acad. Sci. 
5 USA 89(19): 9267-71): Amplification of AKT-2 

has been detected in 10% of pancreatic 
carcinomas. 



AKT-2 pancreatic Cheng et aL % (1996) Proc. Natl. Acad. Sci. USA 

93(8):3636-41 : Amplification of AKT-2 has been 
detected in 10% of pancreatic carcinomas. 

c-erbB-2 bladder c-ErbB-2 is also known as HER2/neu. V-erbB is 

the oncogene of the avian erythroblastosis virus. 

1. Underwood et al y (1995) Cancer Res. 
55(ll):2422-30: Protein overexpression was 
observed in 45% of patients with non- recurrent 
disease and 50% of patients with recurrent 
disease; 9% of bladder tumors analyzed shoed 
gene amplification. 

2. Coombs et al, (1993) Pathology 169(1):35- 
42: c-ErbB-2 gene amplification was observed in 
14% of bladder tumors analyzed. 

3. Gardiner et al. , (1992) Urolog. Res. 20(2): 17- 
20: Nineteen percent of primary transitional cell 
bladder carcinomas showed c-erbB-2 gene 
amplification. 



c-erbB-2 



breast 



1. Molina et aL, (1966) Anticancer Research 
16(4B): 2295-300: Abnormal c-erbB-2 levels were 
found in 9.2% of patients with locoregional breast 
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carcinoma, and in 45.4% of patients with 
advanced disease. 2. DePotter et aL, (1995) 
Virchows Arch. 426(2) : 1 07- 1 5 : O verexpression of 
the oncoprotein . is observed in about 20% of 
invasive duct cell carcinomas of the breast. 3. 
Bandyopadhyay et aL, (1994) Acta Oncol. 
33(5):493-8: 35.4% of breast tumors showed c- 
erbB-2 overexpression; 17.4% showed gene 
amplification. 4. Fontana et aL, (1994) 
Anticancer Res. 14(5B): 2099- 104: 26% of 
samples showed c-erbB-2 amplification. 5. Press 
et aL, (1993) Cancer Research 53(20):4960-70: 
Amplified overexpression was identified in 38% 
of primary breast cancers. 6. Berns et aL, 
(1992) Cancer Res. 52(5): 1107-13: 23% of 
primary breast cancer tissues exhibited 
amplification. 7. Delvenne et aL , (1992) Eur. J. 
of Cancer 28(2-3): 700-5: c-erbB-2 mRNA was 
overexpressed in 34% of breast tumor samples. 
8. Inglehart, (1990) Cancer Res. 50(20): 670 1-7: 
Two to thirty-two-fold gene amplification was 
found in multiple stages of tumor progression. 9. 
Slamon et aL, (1989) Science 244:707-12: A 
28% incidence of amplification of c-erbB-2 was 
found in 189 primary breast cancers. 10. Kraus 
et aL, (1987) EMBO J. 6(3):605-10: Eight cell 
lines demonstrated c-e/*B-2 mRNA levels ranging 
from 4 to 128-fold overexpression. 60% of all 
tumors analyzed showed elevated levels of c-erbB- 
2 mRNA. 
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1. Osaki et al., (1995) Chest 108(1): 157-62: 
Lung tissue overexpression of c-erbB-2 was 
discovered in 42.5% of samples. 2. Lorenz et 
al, (1994) Clin. Invest. 72(2):156-63: A 64-fold 
increase in the amount of c-erbB-2 mRNA was 
observed; 33% of lung tumors showed 
overexpression of c-erbB-2. 



c-erbB-2 



ovarian 



10 



15 



20 



25 



1. Katsaros et al, (1995) Anticancer Res. 
15(4): 1501-10: Abnormally high expression of c- 
erbB-2 was found in 31% of tumor samples. 2. 
Felipe/., (1995) Cancer 75(8):2147-52: 21.7% 
of ovarian tumors showed overexpression of c- 
erbB-2. 3. Fan et ai, (1994) Chin. Med. J. 
107(8): 589-93: c-erbB-2 amplification was found 
in 30.8% (8 of 26) of human ovarian cancers. 4. 
vanDam et ai, (1994) J. of Clin. Path. 
47(10):914-9 : 24% of ovarian tumors showed c- 
erbB-2 overexpression. 5. Csokay el ai, (1993) 
Eur. J. of Surg. Oncology 19(6):593-9: c-erbB-2 
amplification was found in 34% of fresh ovarian 
tumor samples. 6. McKenzie et al. y (1993) 
Cancer 7 1(1 2): 3942-5: 30% of ovarian tumor 
samples indicated c-erbB-2 overexpression. 7. 
Hung et aL y (1992) Cancer Letters 61(2):95-103: 
A 100-fold c-erbB-2 overexpression was 
discovered in one human cell line. Two to four- 
fold amplification was also discovered. 



MDM-2 leukemia MDM-2 is the murine double minute-2 oncogene. 

1 . Bueso-Ramos et al. , (1993) Blood 82(9):2617- 
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23: 53% of cases showed overexpression of 
MDM-2 mRNA. The level of MDM-2 mRNA 
overexpression in some cases of leukemias was 
comparable to that observed in some sarcomas, 
which demonstrate more than 50-fold MDM-2 
gene amplification. No evidence of gene 
amplification was observed. 2. Watanabe et aL , 
(1994) Blood 84(9): 3 158-65: 28% of patients 
with B-cell chronic lymphocytic leukemia or non- 
Hodgkin's lymphoma had 10-fold higher levels of 
MDM-2 gene expression. MDM-2 overexpression 
was found more frequently in patients at advanced 
clinical stages. 

\-myb is the oncogene of the avian 
myeloblastoma virus. 1. Ramsay et aL, (1992) 
Cell Growth and Diff. 3(10):723-30: z-myb levels 
were always higher in colon cancer samples than 
normal tissue. 2. Alitalo et al s (1984) Proc. 
Natl Acad. ScL 81(14):4534-8: c-myb levels 
were always higher in colon cancer samples than 
normal tissue. 

V-myc is the oncogene of the avian myelocytoma 
virus. 1. Lonn et aL, (1995) Cancer 
75(1 1):2681-7: Amplification of z-myb occurs in 
16% of patients with breast cancer. 2. Hehir et 
aL, (1993)7. of Surg. Oncology 54(4): 207-9: c- 
myc overexpression was found in 60% of breast 
carcinoma samples. 3. Kreipe et aL, (1993) 
Cancer Research 53(8): 1956-61 : Amplification of 
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10 



c-myc was found in 52.6% of samples that 
displayed a Ki-Sl labelling index exceeding 30%. 
4. Watson et al , (1993) J. Nat. Cancer Inst. 
85(ll):902-7: Amplification of c-myc occurs in 
up to 20 - 30% of breast cancers. 5. Berns ex 
al, (1992) Cancer Research 52(5): 1107-13: 
Amplification was found in 20% of primary breast 
cancer patients; the range was 3-14 gene copies. 
6. Watanabe et al, (1992) Cancer Research 
52(19):5 178-82: Expression of c-myc was 
increased by 10- fold. 



c-myc 



gastric/ 
colorectal 



15 



20 



25 



1. Rigas, (1990) Clin. Gastroent. 12(5):494-9: 
Overexpression of c-myc is found in 80 of colon 
cancers. 2. Erisman et al., (1988) Oncogene 
2(4): 367-78: Adenocarcinoma cell lines express 
5-10-fold elevated levels of c-myc mRNA. Eight 
to thirty-seven-fold higher levels of c-myc protein 
was found in tumor cell lines compared to normal 
cells. 3. Sikora et al., (1987) Cancer 
59(7): 1289-95: Up to 32-fold overexpression of 
c-myc mRNA was observed in 12 to 15 tumors. 
4. Tsuboi et al, (1987) Biochem. and Biophys. 
Res. Comm. 146(2) :705- 10: Gastric Cancer: A 
2-3-fold overexpression was observed in gastric 
cancer. A 2-10-fold overexpression was observed 
in colorectal cancer. 



c-myc lung 1. Lorenz et al, (1994) Clin. Invest. 72(2): 156- 

63: A 57-fold increase in c-myc mRNA levels 
was observed. 23% of samples indicated strong 
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expression of c-mvc. 2. Kato et al. , (1993) Jap. 
J. of Cancer Res. 84(4):355-9: Liver tissue 
metastases from human small cell lung carcinoma 
revealed 30-fold amplification of c-myc. 



c-m\c 



naso- 

pharn- 

geal 



Porter et al, (1994) Acta Oto-Laryng. 114(1): 
1105-9: 22% of samples showed intense staining 
for c-myc. 



c-mxc 



ovarian 



10 



15 



20 



1. Bian et al, (1995) Chin. /. of Ob. Gyn. 
30(7):406-9: 50% of samples showed 
amplification of c-myc. 2. Katsaros et al, 
(1995) Anticancer Res. 15(4):150M0: 26% of 
samples exhibited c-myc amplification. 3. van 
Dam et al, (1994) J. Clin. Path. 47(10):914-9: 
Overexpression of c-myc was found in 35% of 
ovarian carcinomas. 4. Xin et al. f (1993) Chin. 
J. of Ob. Gyn. 28(7):405-7: 54.5% of samples 
showed amplification of c-myc. 5. Tashiro et al. , 
(1992) Int. J. of Cancer 50(5): 828-33: 
Overexpression was found in 63.5% of all serous 
adenocarcinoma tissues and 37.3% of all ovarian 
carcinoma tissues. Significant overexpression of 
c-myc was observed at Stage III compared with 
other stages. 



c-mvc prostate Nag etai, (1989) Prostate 15(2): 115-22: A 10- 

25 fold amplification of c-myc was observed. Fifty- 

fold higher levels of mRNA transcripts of c-myc 
were found. 
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lung 
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Ras oncogenes were first recognized as the 
transforming genes of Harvey and Kirsten murine 
sarcoma viruses. Lorenz et aL, (1994) Clin. 
Invest. 72(2): 156-63: a 13-fold increase in 
overexpression of c-Ki-ras was observed. 18% of 
tumors displayed strong overexpression of c-Ki- 
ras. 



c-ras 



ovarian 



10 



15 



1. Katsaros et al, (1995) Anticancer Res. 
15(4): 1501-10: Higher levels of ras protein than 
in normal or benign ovarian tumors were found in 
45% of tumor samples. 2. vanDam et a/., 
(1994) J. of Clin. Path. 47(10):914-9: 20% of 
ovarian tumors exhibited c-ras overexpression. 
The levels of expression of c-ras were much 
higher in tumors of patients with recurrent or 
persistent disease after chemotherapy, than in the 
tumors of patients at initial presentation. 



c-src 



20 



breast 



V~src is the oncogene of the Rous sarcoma virus, 
which induces sarcomas in chickens. 
Muthuswamy et a/. , (1 994) Mol. and Cell. Biol 
14(1): 735-43 : c-e/*B-2-induced mammary tumors 
possessed 6-8-fold higher c-src kinase activity than 
adjacent epithelium. 



c-src 



25 



colon/ 
colorectal 



1. Cartwright et al., (1994) J. of Clin. Invest. 
93(2):509-15: c-src activity is 6-10-fold higher in 
mildly dysplastic ulcerative colitis (a chromic 
inflammatory disease of the colon with a high on 
incidence of colon cancer) than in non-dysplastic 
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epithelia. This data suggests that activation of c- 
src is an early event in the genesis of UC colon 
cancer. 2. Talamonti et aL, (1993) J. of Clin. 
Invest. 91(l):53-60: High level of c-src activity 
from colorectal cancer is found in liver 
metastases. 3. Termuhlen et al y (1993) J. of 
Surg. Res. 54(4): 293-8: Colon carcinoma 
metastases to the liver had significantly increased 
activity of c-src with an average 2.2-fold increase. 
Extrahepatic colorectal metastases demonstrated an 
average 12.7-fold increase in c-src activity over 
normal mucosa. 

V-yes is the oncogene of two avian sarcoma 
viruses, Esh sarcoma virus and Y73. 1. Pena et 
aL, (1995) Gastroent. 108(1): 117-24: Twelve to 
fourteen-fold higher expression of c-yes was found 
in colonic transforming oncogene adenomas 
compared to normal mucosa. Activity of c-yes 
was elevated in adenomas that are at greatest risk 
for developing cancer. 2. Park et aL, (1993) 
Oncogene 8(10):2627-35: A ten to 20-fold higher 
than normal activity of c-yes was observed in 3 
out of 5 colon carcinoma cell lines. A 5-fold 
higher than normal activity was found in 10 out of 
21 primary colon cancers, compared to normal 
colonic cells. 
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Selection of Cognate Transgene for Preparation of Cellular Immunogen 

According to the present invention, a transgene construct is 
engineered comprising a transgene which is cognate to the target proto-oncogene 
(hereinafter "cognate transgene" or "CTG"). The transgene is selected such that 
5 it encodes a gene product which induces host immunoreactivity to host self- 
determinants of the product of the target proto-oncogene. The transgene should 
be expressed to very high levels in the transfectants. Thus, the construct should 
contain a strong promoter. 

The product encoded by the cognate gene must have a high 

10 degree of sequence homology with the product of the target proto-oncogene, but 
also must display some amino acid differences with the target proto-oncogene 
product. Thus, there must be a subset of one or more amino acid differences 
between the target proto-oncogene and its cognate in order to provide 
immunogenic stimulus. Two classes of genes that satisfy these criteria are 

15 retroviral oncogenes and xenogenic proto-oncogenes. The word "xenogenic" 
is intended to have its normal biological meaning, that is, a property or 
characteristic referring or relating to a different species. Thus, a xenogenic 
proto-oncogene is meant to include the a homologous proto-oncogene of a 
species other than the host organism species. It may be appreciated that in the 

20 case of a target proto-oncogene, e.g. MDM2, for which no retroviral homolog 
is yet known, a xenogenic homologue is advantageously utilized as the source 
of the DNA for the cognate transgene. 

In principle, a more effective immunogenic stimulus would 
depend on the particular sequence, and not on the distinction between a 

25 retroviral oncogene and a xenogenic proto-oncogene in terms of their relative 
transforming capacity. Thus, in certain cases, a retroviral oncogene may be 
better at providing a tolerance-breaking immunogenic stimulus, and in other 
cases, a xenogenic proto-oncogene may be more effective. 

The retroviral oncogene or xenogenic proto-oncogene DNA 

30 forming the CTG may comprise the wild type oncogene or proto-oncogene 
DNA. More preferably, a mutant DNA is utilized, which is engineered so as 
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to be non-transforming in the host. The DNA is mutated to include one or 
more nucleotide insertions, deletions or substitutions which will encode an 
oncogene product which is nontransforming in the host, but retains the requisite 
degree of sequence homology with respect to the target proto-oncogene. A 
5 cognate transgene deletion mutant (hereinafter "dCTG") is preferred. 

A protein sequence is generally considered "cognate" with respect 
to the target proto-oncogene-encoded protein if it is evolutionarily and 
functionally related between species. A more precise view of cognation is based 
upon the following sequence comparison carried out utilizing the FASTA 
10 program of Pearson and Lipman, Proc. Natl. Acad. ScL USA (1988), 85:2444- 
2448, the entire disclosure of which is incorporated herein by reference. 
Cognation is attained upon satisfying two criteria imposed by FASTA; (i) 
alignment of segments corresponding to at least 75% of the target proto- 
oncogene's encoded amino acid sequence; (ii) at least 80% amino acid identity 
15 within the aligned sequences. The segments of the target proto-oncogene 
protein sequence and protein test sequence satisfying the two criteria are 
referred to as "homology regions". Accordingly, at least 75% of the target 
proto-oncogene protein sequence is alignable with the test sequence. The 
alignable segments or homology regions may, however, represent less than 75% 
10 of the total test polypeptide chain for the case of test sequences that may 
significantly exceed the target proto-oncogene protein in length. 

One skilled in the art, armed with the FASTA program, may 
survey existing sequence data bases (either protein sequences or DNA 
sequences, insofar as the amino acid sequence is determined by FASTA for all 
5 reading frames) for test sequences which are cognate with respect to the target 
proto-oncogene. At the same time, one can isolate and then sequence what are 
very likely to be cognate test sequences (e.g. feline MDM-2, as likely to be 
cognate to human MDM-2) and use FASTA to verify the presumed cognation, 
according to the criteria set above. One may obtain the sequences of 
0 presumptive cognate proto-oncogenes from a large number of mammalian 
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sequences and screen these sequences with FASTA according to the aforesaid 
formulation of cognation. 

Because the product encoded by a CTG differs at a small number 
of amino acid positions from the product encoded by the target proto-oncogene, 

5 an immunogenic stimulus is provided that (i) is directed against the foreign 
protein and (ii) with a lower probability, induce an anti-self response. The 
CTG is selected such that the gene product will yield the greatest immunogenic 
stimulus to induce anti-self reactivity. Provided that overall sequence homology 
(preferably greater than about 75%) is maintained, the presence of scattered 

10 amino acid differences is desired, since any one residue would likely have a 
relatively low probability of inducing self-reactivity. Moreover, the greatest 
number of residue differences would be advantageous, consistent with 
maintaining the requisite degree of general sequence homology. 

The selection of amino acid modifications for the CTG may be 

15 facilitated by resort to available computer-based models used to identify 
immunogenic peptide fragments of polypeptides. These models could be 
employed to select CTGs which would possess the maximum number of 
immunogenic peptides for a given HLA haplotype. 

Screening Procedure for CTG Selection 

20 Notwithstanding the availability of computer-based algorithms 

which have some predictive value, it is desirable to design CTGs with resort to 
a screening procedure based on an actual experimental assay that can be HLA- 
haplotype specific. Accordingly, cells are biopsied from a normal volunteer of 
particular haplotype. The cells are transfected with a CTG construct, preferably 

25 a dCTG construct, satisfying the criteria set for cognition. More preferably, the 
cells are transfected with multiple dCTGs, preferably at least five dCTGs, 
satisfying the criteria for cognition. The at least five dCTGs are selected to 
display amino acid differences that essentially extend throughout the polypeptide 
chains of the encoded sequences. The transfected cells are then used to 

30 immunize the volunteer in accordance with the immunization method of the 
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present invention. After immunization, the human subject is tested in a standard 
delayed hypersensitivity (DH) reaction with 10 4 -10 6 irradiated, autologous 
fibroblasts, as transfected with the same dCTG (or series of dCTGs) as used for 
the immunizing preparation. A positive DH reaction (induration) would verify 
5 the induction of reactivity. The induction of reactivity in this assay is readily 
demonstrable because of the priming to the non-self determinants on the dCTG- 
encoded protein and the readout in the DH reaction of the same nonself 
determinants. Once DH reactivity is demonstrated in a DH reaction that 
directly tests the antigenicity of the non-self determinants encoded by the dCTG 

10 (i.e., priming with a non-self construct, DH testing with the same non-self 
construct), the subject can be then tested in a DH reaction based on testing with 
the autologous cells transfected with a dCTG derived from the human proto- 
oncogene itself (i.e., priming with a non-self construct, testing with the human 
self construct). Testing of a battery of human volunteers will lead to a 

15 catalogue of HLA-matched dCTGs, such that, for individuals of the same HLA 
haplotype, the use of the particular dCTG would be inductive of reactivity to 
proto-oncogene-encoded self. Different CTGs may thus be tested so as to 
correlate maximal secondary stimulation with a particular HLA haplotype. 

At the same time, this procedure may be used with patients 

20 undergoing tumor resection (if post-operative immuno-suppressive protocols are 
not mandatory), such that prior to resection, a course of immunization would 
have been initiated, the endpoint of which would represent the development of 
a DH reaction. 

Any given amino acid difference between the CTG-encoded 
25 product and the proto-oncogene-encoded product has a low probability of being 
a "tolerance-breaker". Thus, it is preferable to transfect the host cells with a 
mixture of multiple different CTGs, preferably dCTGs. The number of 
different dCTGs is preferably five or more. Moreover, it is preferred that, 
among themselves, the multiple dCTGs show amino acid differences that 
30 essentially extend throughout the polypeptide chains of the encoded sequences. 
The dCTGs would be selected to maximize amino acid differences and, at the 
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same time, make sure that differences are found all along the polypeptide chain. 
It would thus not be preferable to select a battery of deletions all from within 
the same domain of the polypeptide chain. 

According to a protocol which utilizes 10 7 irradiated cells for 
5 immunization containing five separate dCTGs, five groups of 2 X 10 6 cells are 
included in one inoculate, each group of 2 X 10 6 having been transfected with 
a separate dCTG from the total set of five CTGs that are cognate to a particular 
proto-oncogene. 

Selection of Non-Transforming Cognate Transgenes 

10 Non-transforming cognate transgene variants are most 

advantageously derived via deletion of a sequence essential for transformation. 
Unlike point mutations which are potentially reversible due to back mutations, 
deletion mutations are irreversible. Furthermore, deletion mutations do not 
possess the inherent disadvantage attaching to point mutations, namely, even 

15 though the requirement for generation of an acceptable cognate transgene is for 
a qualitative difference with the wild type, i.e., non-transforming versus 
transforming, any given point mutation may be neutral or else quantitative in 
its effect, that is, the mutation may reduce but not totally eliminate 
transformability. Thus, according to a preferred embodiment of the invention, 

20 a deletion is created in a region of the cognate transgene which encodes an 
amino acid sequence required for transformation. Consonant with non- 
transformability, the smallest deletion possible so as to leave intact the bulk of 
the antigenicity of the transgene product is selected. 

The engineering of a cognate transgene deletion mutant that 

25 satisfies these criteria is facilitated by reports of structure-function relationship 
in oncogene-encoded proteins. Such reports serve to identify regions of 
oncoproteins that are essential for transformation, as opposed to regions which 
are either neutral or serve merely to modulate transformability. Although such 
reports are usually based on in vitro transformation assays, and are therefore 

30 independent of immune effects, these studies can be exploited to aid in the 
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construction of non-transforming dCTGs for use in the practice of the present 
invention. 

The deletion mutant is engineered to include at least a part of the 
region identified as critical for transformation. In those cases where essential 
5 amino acids have been identified, the deletion will span these residues. The 
engineering of any desired deletion can be readily accomplished by polymerase 
chain reaction (PCR) according to conventional PCR techniques, based upon the 
known nucleotide sequence of the unmutated cognate transgene. 

The following describes a representative protocol for deriving a 
10 non-transforming dCTG of the smallest possible deletion, for use in the practice 
of the present invention. A test dCTG, engineered on the basis of known or 
ascertained transformation-specific domains, and driven by the strongest possible 
promoter, is used to transfect murine 3T3 cells. A sister culture of 3T3 cells 
is also transfected, with non-deleted CTG. Each CTG or dCTG cell culture is 
15 inoculated into nude mice, in the absence of any treatment to render the cells 
non-dividing. Those dCTGs which do not yield tumors in the mice even after 
prolonged observation are then utilized as transgenes for the biopsied human 
cells which, upon transfection with the transgene, will serve as a cellular 
vaccine according to the practice of the present invention. The dCTGs are 
20 selected with the smallest deletion mutant consonant with non-transformabiiity. 

Some CTGs representing xenogenic proto-oncogenes may not be 
tumorigenic in the 3T3/nude mouse assay. For any such non-transforming 
CTG, it is not essential to generate a dCTG. However, even given non- 
tumorigenicity in nude mice, it may be desirable to opt for generation of a 
25 deletion mutant when the transgene is based upon a xenogenic proto-oncogene. 
In such cases, the deletion would be engineered so as to remove the 
homologous region to that deleted in the particular dCTG that corresponds to 
the deletion in the corresponding retroviral oncogene dCTG. 

Even though the transgene construct may comprise mutant 
30 oncogene or proto-oncogene DNA which is nontransforming, it is nevertheless 
preferable, as a safety measure, to treat the transfected cells to render them non- 
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dividing before inoculation back into the host. The cells are irradiated with a 
radiation dosage sufficient to render them non-dividing, 

Oncogenicity Assay of Cognate Transgenes 

As a further safety measure, the oncogenicity of a given dCTG 
5 is preferably thoroughly tested prior to infection of the human host cells which 
are used as cellular irnmunogens according to the practice of the present 
invention. For example, an oncogenicity testing regimen may take the form of 
three separate assays: (i) dCTG transfection of NIH 3T3 cells, followed by 
inoculation into nude mice; (ii) dCTG transfection of human fibroblasts, 

10 followed by inoculation into nude mice; and (iii) dCTG transfection of human 
fibroblasts, followed by an in vitro test of anchorage-dependent growth. In 
principle, all three should be negative to validate the use of any given dCTG in 
the vaccination method of the present invention. 

According to the oncogenicity assay (i), after stable transfection 

15 of NIH 3T3 cells with the test dCTG, the transfectants are inoculated into nude 
mice. Tumorigenicity of the transfectants in the mice is then evaluated 
according to standard protocols. 

According to oncogenicity assay (ii), human fibroblasts are 
transfected with the test dCTG as proposed in the above human immunization 

20 protocol. After stable dCTG transfection of human fibroblasts, however, rather 
than carrying out X-irradiation of the transfectants to render them non-dividing, 
followed by inoculation of the irradiated transfectants back into the human host, 
the transfectants are directly inoculated into nude mice as a direct test of 
tumorigenicity. Given the greater susceptibility of murine 3T3 cells to 

25 oncogenic transformation, vis a vis primary human or murine transfectants 
fibroblasts, assay (ii) is probably much less sensitive than assay (i), but does 
have the advantage of offering a direct test of dCTG oncogenicity in human 
cells. 

According to oncogenicity assay (iii), non-irradiated dCTG- 
30 transfected human fibroblasts are assayed for anchorage-dependent growth, i.e. 
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colony formation in soft agar, as a test of dCTG transforming potential in 
human cells. Anchorage independence, as defined by the ability of cells to 
grow when suspended in semisolid medium, is a common phenotype acquired 
by human tumor cells, particularly those tumor cells of mesenchymal origin, 
5 such as fibrosarcomas. While assay (iii) has no in vivo readout, it offers an 
independent test of the critical issue of dCTG oncogenicity in human cells. 

The oncogenicity assays are performed according to published 
protocols. Assay (i), comprising dCTG transfection of NIH 3T3 cells followed 
by inoculation into nude mice, may be performed according to the protocol of 
10 Stevens et al y Proc. Nail. Acad. Sci. USA (1988), 85:3875-3879, including 
DNA transfection by the calcium phosphate coprecipitation method of 
Manohaven et al , Carcinogenesis ( 1 985) , 6: 1295- 1 30 1 . Accordingly , NIH 3T3 
cells (7.5 X 10 5 cells per 100-mm dish) are exposed to a calcium phosphate- 
DNA coprecipitate (40 fig of genomic DNA plus 3 /xg of pSV2neo per dish) for 
15 4 hours. Two days later, each dish is trypsinized and reseeded into a 175-cm 2 
flask. For the next 10 days, cultures are selected in G418 (400 ng/ml) y and the 
flasks are then trypsinized and cells are replated in the same flask to disperse 
the G418-resistant colonies into a diffuse lawn of cells. Two days later, the 
cells are harvested and washed with serum-free medium prior to injection. One 
20 injection of 5 X 10 6 cells into the right flank and one injection of 1 X 10 7 cells 
into the left flank, each in a volume of 200 pi, are done on each nude mouse. 
Injection sites are monitored at 3- or 4-day intervals for 100 days. The sites are 
scored for the number of tumors induced per injection site. 

Oncogenicity assay (ii), whereby dCTG transfection of human 
25 fibroblasts followed by inoculation into nude mice, is carried out in the same 
manner as assay (i) except that for assay (ii) the human fibroblast transfectants 
are substituted for the murine 3T3 transfectants. 

Assay (iii), involves a test of the in vitro anchorage-dependent 
growth of dCTG-transfected human fibroblasts. The assay is carried out as 
30 described in Stevens et al., 7. Cancer Res. and Clin. Oncol. 1989, 115:118- 
128. 1 x 10 5 cells are seeded per 60-mm dish into 0.33% Noble agar over a 
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6-mI 0.5% agar base layer in Hams F10 supplemented with 6% fetal bovine 
serum. A portion of the agar suspension is diluted with Hams F10 plus 6% 
fetal calf serum to 200 cells/5 ml to determine the cloning efficiency of these 
cells when seeded into plastic 60-mm dishes. Agar dishes are fed with 1 ml 

5 Hams F10 supplemented with 6% fetal bovine serum on the 1st and 15th day 
after seeding. Four weeks after seeding, all agar colonies >75 /*m in diameter 
are counted and the colony counts are normalized to the plating efficiencies 
which aliquots of the initially seeded cells showed on plastic. This comparison, 
or normalization, of the agar colony counts to the plastic dish colony counts is 

10 useful in identifying and correcting for any mechanical artifacts which might 
result from the seeding into agar of dead cells that had persisted from the initial 
transfection treatment or from heat-induced cell death, which might have 
occurred while suspending cells in molten agar during the process of seeding the 
agar dishes. 

15 The following is a partial list of various deletions which, based 

upon published accounts of experiments with human or animal cells, are 
believed to render the identified CTG non-tumorigenic. 
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Engineering of Vectors for Host Cell Transfection 

The engineering of vectors for expression of a particular CTG. 
preferably a dCTG, is based on standard methods of recombinant DNA 
technology, i.e. insertion of the dCTG via the polylinker of standard or 
5 commercially available expression vectors. The dCTG is operably linked to a 
strong promoter. Generally speaking, a "strong" promoter is a promoter which 
achieves constitutively high expression of the dCTG in the transfected cells. 
Each promoter should include all of the signals necessary for initiating 
transcription of the relevant downstream sequence. These conditions are 
10 fulfilled, for example, by the pBK-CMV expression vector available from 
Stratagene Cloning Systems, La Jolla, CA (catalog no. 212209). The pBK- 
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CMV vector contains the cytomegalovirus (CMV) immediate early promoter. 
dCTGs xenogenic with respect to a particular target proto-oncogene may be 
isolated by conventional nucleic acid probing techniques, given the availability 
of a highly homologous probe represented by the cognate retroviral oncogene 
5 and/or the human proto-oncogene itself. 

Collection of Host Cells for Transfection 

The host cells which may be transfected to derive the cellular 
immunogens of the present invention must express class I MHC and be 
susceptible to isolation and culture. Fibroblasts express class I MHC and may 

10 be cultured. Accordingly, punch biopsies of host human skin are performed to 
harvest fibroblasts. Punch biopsies can be performed by a competent physician 
as a standard clinical procedure. Each biopsy yields a starting population of 1-2 
X 10 7 cells that would proliferate in culture. Methods for the preparation of 
tissue cultures of human fibroblasts are well developed and widely used. See, 

15 Cristofalo and Carpenter, J. Tissue Culture Methods (1980), 6:117-121, the 
entire disclosure of which is incorporated herein by reference. Essentially, skin 
obtained by punch biopsy is washed using an appropriate wash medium, finely 
minced and cultured in a suitable culture medium, such as Dulbecco's Modified 
Eagle Medium (DMEM), under C0 2 at 37°C. The cells are trypsinized with 

20 a trypsin solution and transferred to a larger vessel and incubated at 37 °C in 
culture fluid. 

Host Cell Transfectip n 

The expression vector carrying the dCTG is used to transfect 
biopsied host cells according to conventional transfection methods. One method 
25 of transfection involves the addition of DEAE-dextran to increase the uptake of 
the naked DNA molecules by a recipient cell. See McCutchin and Pagano, J. 
Natl. Cancer Inst. (1968) 41:351-7. Another method of transfection is the 
calcium phosphate precipitation technique which depends upon the addition of 
Ca ++ to a phosphate-containing DNA solution. The resulting precipitate 
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apparently includes DNA in association with calcium phosphate crystals. These 
crystals settle onto a cell monolayer; the resulting apposition of crystals and cell 
surface appears to lead to uptake of the DNA. A small proportion of the DNA 
taken up becomes expressed in a transfectant, as well as in its clonal descen- 
5 dants. See Graham ex ai, Virology (1973), 52:456-467 and Virology (1974), 
54:536-539. 

Preferably, transfection is carried out by cationic phospholipid- 
mediated delivery. In particular, polycationic liposomes can be formed from 
N-[l-(2,3-dioleyloxy)propyl]-N,N,N-trimethylammonium chloride (DOTMA) 

10 or related liposome-forming materials. See Feigner et aL, Proc. NatL Acad. 
ScL USA (1987) 84:7413-7417 (DNA- transfection); Malone et a/., Proc. NatL 
Acad. ScL USA (1989), 86:6077-6081) (RNA-transfection). One preferred 
technique utilizes the LipofectAMINE™ Reagent (Cat. No. 18324-012, Life 
Technologies, Inc., Gaithersburg, MD) which is a 3:1 (w/w) liposome 

15 formulation of the polycationic lipid 2,3-dioleyloxy-N- 
[2(sperminecarboxamido)ethyl-N ,N-dimethyl- 1 -propanaminium trifluoroacetate 
(DOSPA) (Chemical Abstracts Registry name: N-[2-({2,5-bis[(3- 
aminopropyl)amino]-l-oxypentyl}amino)ethyl]-N,N-dimethyl-2,3-bis(9- 
octadecenyloxyH -propanaminium trifluoroacetate), and the neutral lipid 

20 dioleoyl phosphatidylethanolamine (DOPE) in membrane filtered water. 
Transfection utilizing the LipofectAMINE™ Reagent is carried out according to 
the manufacturer's published protocol. The protocol (for Cat. No. 18324-012) 
provides for either transient or stable transfection, as desired. 

The advantage of transient expression is its rapidity, i.e. there is 

25 no requirement for cellular proliferation to select for stable integration events. 
This rapidity could conceivably be of major clinical importance, in cases of an 
already metastatic tumor burden, wherein the weeks required for selection of 
stable transfectants may simply not be available to the clinician. 

There are, nonetheless, two general disadvantages to the use of 

30 transient transfection. The first is that expression usually peters out after a few 
days, in contrast to the continual expression in the case of stable transfection. 
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This is not particularly crippling in terms of our immunization protocol. The 
inoculated, irradiated cells used for immunization would likely not survive in 
vivo for more than 4 or 5 days, in any case. Thus the nominal advantage 
accruing to stable transfection, that of a long-duration expression by the progeny 
5 of the parental inoculated cell, is not of particular relevance in the case of the 
immunizing regime described herein, which is based on the use of non-dividing, 
probably short-lived cells. 

A second disadvantage of transient transfection resides in the fact 
that it yields a cell population, only a subset of which has actually been 

10 transfected and thus expresses the protein encoded by the transgene. This 
problem is obviated in the case of stable transfection, wherein over time one can 
develop a pure population of transfectants via selection for a resistance marker, 
such as neo, under conditions of clonal proliferation of the initial stable 
transfectants, i.e. daughter cells of transiently transfected cells lack the 

15 transgene, in contrast to the case with stable transfectants. In the situation 
where there is sufficient time to effect immunization based on stably transfected 
cells, the progeny of all transfected clones would be utilized, not just the 
progeny of a single clone, as is sometimes done for detailed biochemical and 
molecular analyses of gene expression. Clearly the more clones utilized, the 

20 more quickly one can arrive at the requisite number of cells to be used for 
immunization. 

Percentage of Cells Exhibiting dCTG Expression 

The percentage of cells exhibiting dCTG expression may be 
determined by an immunohistology assay. In this procedure, a small number 

25 of cells ( - 500) from the harvested pellet following centrifugation of transfected 
cells are deposited on a cover slip and fixed with cold acetone. At this point, 
a standard immunohistological assay is carried out with the cells on the cover 
slip, i.e. addition of a primary monoclonal antibody reactive to the dCTG- 
encoded protein, followed by the addition of a developing antibody, e.g. a 

30 fluorescent tagged antibody reactive to the primary monoclonal antibody. 
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Measurement of the percentage of cells scoring as dCTG-positive in the 
fluorescent assay allows a determination of the number of positive transfectants 
in the starting culture, and thus the number of total cells to be used for 
immunization to arrive at the desired number of dCTG-positive cells to be 
5 inoculated in the patient. 

If, as would be almost certain, the percentage of cells scoring as 
dCTG-positive is less than one hundred percent, one can simply increase the 
number of cells to be used for immunization, so as to include the desired 
number of transfectants. The non-transfected cells in the immunizing population 
10 would simply represent x-irradiated, autologous fibroblasts that would constitute 
no danger to the patient. 

Transfectant Irradiation 

Prior to return to the host, the transfected cells are preferably 
irradiated. The transfectants are irradiated with a radiation dose sufficient to 
15 render them non-dividing, such as a dose of 25 By or 2500R. The cells are 
then counted by trypan blue exclusion, and about 2 X 10 7 irradiated 
transfectants are resuspended in a volume of 0.2-0.4 ml of Hanks Balanced Salt 
Solution. 



Vaccination Procedure 

The transfected cells are returned to the host to achieve 
vaccination. The cells may be reimplanted at the same body site from which 
they were originally harvested, or may be restored to a different site. 

It is the object of the present invention to generate a systemic 
tumor immune response, so as to fight metastasis formation wherever any 
metastases are found. Accordingly, there is no reason to inject the transfected 
cells at the same body site from which they were taken. Intramuscular or 
subcutaneous inoculation at a distal site would suffice to yield a systemic 
response. Thus, patients are preferably vaccinated by subcutaneous inoculation 
of the transfected cells. 
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For s-crc overexpression associated with colon carcinoma, partial 
venous inoculation is preferred, as the liver is a frequent site of metastases. For 
vaccinating against breast cancers and lymphomas, systemic immunization is 
preferred. 

5 As a general rule, it is desirable to generate the strongest immune 

response consistent with clinical monitoring of no adverse side effects, i.e. 
multiple rounds of inoculation with, for example 10 7 cells, at each round. The 
number of rounds of inoculation is selected accordingly. The efficacy of the 
inoculation schedule may be monitored by a delayed hypersensitivity reaction 

10 administered to the patient. A course of about up to 10 inoculations, at 2-3 
week intervals, may be utilized. It may be appreciated that the inoculation 
schedule may be modified in view of the immunologic response of the 
individual patient, as determined with resort to the delayed-type hypersensitivity 
(DTH) reaction. 

15 Patient Response Monitoring bv Delaved-tvpe Hypersensitivit y Reaction 

Patients are assessed for reactivity to the irradiated transfectants 
by a test of skin reactivity in a DTH reaction. DTH has been used clinically 
(Chang et ai (1993), Cancer Research 53:1043-1050). To measure reactivity 
to the autologous irradiated transfectants, 10 4 - 10 6 cells in a volume of 0.1 ml 

20 Hanks buffered saline solution (HBSS) are inoculated intradermal^ into the 
host. Induration is measured 48 hours later, as an average of two perpendicular 
diameters (responses of greater than £2 mm is considered positive). 

One advantage to the DTH assay is that it can independently 
assess the induction of T cell reactivity to (i) the transfectants used for 

25 immunization (i.e. the set of 5 or more dCTGs chosen for immunization 
purposes, each containing non-self determinants) and (ii) transfectants, as 
transfected with the human dCTG itself containing only self determinants. 
Thus, the induction of reactivity to the transfectants used for immunization 
establishes that the immunizing transfectants are in fact immunogenic, that is, 

30 the patient has not exhibiting a much weakened capacity for immune response. 
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If the patient is demonstrably capable of response to the immunizing 
transfectants, then skin testing with the dCTG (human) transfectants would 
establish whether or not reactivity to the human proto-oncogene encoded product 
had been induced. According to the practice of the invention, inoculation of the 
5 immunizing transfectants would continue for at least as long as the induction of 
reactivity to the human proto-oncogene-encoded protein occurs. 

The practice of the invention is illustrated by the following 
nonlimiting examples. 



Example 1 

10 Immunization of Chickens Against c-.rrrt527)-Induced 

Tumors Bv Vaccination with \-src DNA 



A. Genes 

The oncogene c-src(527) is an activated form of chicken c-src. 
Its protein product pp60 c ' $rcf527) differs from the protein product of c-src, pp60° 

15 src , by only a single amino acid substitution, phenylalanine for tyrosine at 
residue 527 (Kmiecik and Shalloway, (1987) Cell 49, 65-73). This substitution 
eliminates the negative regulatory influence exerted on pp60 c src phosphokinase 
activity by the enzymatic phosphorylation of the position 527 tyrosine. The 
protein product of v-src, pp60 v src , shows a number of sequence differences with 

20 pptiOF™ (Takeya and Hanafusa, (1983) Cell 32, 881-890), including scattered 
single amino acid substitutions within the first 514 residues and a novel C 
terminus of 12 amino acids (residues 515-526), in place of the nineteen C 
terminal amino acids of pp60 csrc (residues 515-533). Both the v-5rc-positive 
plasmid, pMvjrc, and thee- j/r(527)-positiveplasmid, pcsrc527, were originally 

25 shown (Kmiecik and Shalloway, (1987) Cell 49, 65-73) to transform murine 
NIH 3T3 cells in culture. However, the v-$rc-induced transformants exhibited 
a more rapid or more extensive colony growth in soft agarose than the c- 
jrc(527)-induced transformants, as well as a usually shorter latency of tumor 
formation in nude mice (id,). 
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B. Plasmids 

1. pvSRC CI 

The pVSRC-Cl plasmid was prepared as described by Halpern 
et al. y (1991) Virology 180, 857-86. Essentially, the plasmid was derived from 
5 the pRL\y/r plasmid (Halpern et aL, (1990) Virology 175, 328-331) by 
subcloning the v-*/r(+) Xhol-EcoRL fragment of the latter into the multiple 
cloning sequence of pSP65 (Melton et al , (1984) Nucleic Acids Res. 12, 7035- 
7056) which had been cleaved with Sail and £o?RI; since ligation of the Xhol 
overhang at the Sail site destroys both recognition sequences, subsequent 

10 removal of the v-jrc( + ) insert from the vector was achieved by digestion with 
EcoKL and with /fi/zdlll, which cleaves at a position in the multiple cloning 
sequence adjacent to the San site. The pVSRC-Cl plasmid was restricted with 
EcoRl and Hindlll, so as to liberate the tumorigenic insert. This insert included 
the v-src oncogene of the subgroup A strain of Prague RSV, as flanked 

15 downstream by a portion of the long terminal repeat (LTR) of RSV (from the 
5' stan of the LTR, to the single EcoJd site). 

2. pMvsrc 

The pMvsrc plasmid was generously provided by Dr. David 
Shalloway, Cornell University, Ithaca, NY. The plasmid is prepared according 

20 to Johnson et al, (1985) MoL Cell. Biol 5, 1073-1083. Briefly, the 3.1-kb 
BamHl-Bgfll Schmidt Ruppin A v-src fragment from plasmid pN4 (Iba et al. , 
(1984) Proc. Nat. Acad. Sci. USA 81, 4424-4428) is inserted into the pEVX 
plasmid (Kriegler et al. , (1984) Cell 38,483-491) at a Bg/U site lying between 
two Moloney murine leukemia virus (MoMLV) long terminal repeats (LTRs). 

25 This fragment contains 276 bp of pBR322 DNA from the pBR322 BamUl to 
Sail sites followed by 2.8 kb of Rous sarcoma virus (RSV) DNA from the Sail 
site that is about 750 bp upstream of the env termination codon down to the 
Nrul site that is about 90 bp downstream of the \-src termination codon. (The 
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Nrul site is converted to a Bg/ll site in the construction of pN4.) Ligation is 
performed by using a 10:1 insert-vector DNA fragment molar ratio. 

The pMvjrc plasmid was restricted with Nhel, so as to liberate 
a tumorigenic fragment. The fragment included the v-src oncogene of the 
5 subgroup A strain of Schmidt-Ruppin RSV, as flanked upstream by most of the 
Moloney murine leukemia virus (MoMLV) LTR (from the Nhel site near the 
5' start of the LTR, to the 3' end of this LTR) and downstream by a small 
portion of the MoMLV LTR (from the 5' start to the Nhel site). 

3. pcrrc527 

The pcsrc527 plasmid is prepared according to Kmiecik and 
Shalloway, (1987) Cell 49, 65-73. Briefly, a plasmid is constructed by cleaving 
expression vector pEVX (Kriegler etal, (1984) Cell 38,483-491 at its unique 
Bg\W site lying between two MoMLV LTRs and inserting the 3.2 kilobase (kb) 
pair BamHl-Bglll hybrid src fragment from plasmid pHB5 in the proper 
orientation. This fragment contains sequences from pBR322, the SRA env 3' 
region, SRA v-jrc, src from recovered ASV, and chicken c-src. The BglU site 
is generated by insertion of a linker at the Sad site about 20 bp downstream 
from the c-src termination codon. The restriction map of pMHB5 contains the 
MoMLV splice donor about 60 bp downstream from the 3 'end of the upstream 
LTR and the v-src splice acceptor about 75 bp upstream from the src ATG. 

Plasmid pMHB5527 is constructed by inserting the synthetic 
double- stranded DNA oligomer 

5' C CAGTTCCAGCCTGGAGAG AAC CTATA (SEQ ID NO : 1 ) 3' 

3' TCGGGGTCAAGGTCGGACCTCTCTTGGATATCTAG (SEQ ID NO: 2) 5' 

25 into pMHB5 between the Banll site at c-src codon 524 and the downstream 
unique 5^111 site. This alters the TAC Tyr 527 codon to a TTC Phe codon 
while preserving the remaining c-src coding region. Equimolar amounts of the 
double-stranded oligomer and three gel-purified tandem restriction fragments 
from pMHB5 are ligated in one reaction, which contains the following: the 



10 



15 



20 
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oligomer with Banll and Bglll complementary ends, the 3 kb Bglll-Bgll (BgU 
in the pEVX ampicillin resistance gene) partial digest fragment, the adjacent 6. 1 
kb Bgll-Bgll (downstream Bg/7 in c-src) fragment, and the 0.38 kb Bgll-Banll 
(Banll at c-src codon 524) fragment. 
5 Plasmid pwrc527 is constructed by replacing the 2 kb Sail (in 

env)-Mlul (in c-src) fragment in plasmid pMHB5527, with the homologous 
fragment from plasmid p5H. This fragment contains the coding sequence for 
the c-src amino region (codons 1 to 257) that have been isolated by molecular 
cloning of a c-src provirus and previously shown by sequencing to contain 

10 authentic c-src sequence without the mutation at codon 63 (Levy et al , (1986) 
Proc. Natl. Acad. Sci USA 83, 4228-4232). Equimolar amounts of 
complementary gel-purified SaR-Mlul fragments from p5H and the other 
plasmids are ligated. 

The pcsrc527 plasmid was restricted with Nhel, so as to liberate 

15 a tumorigenic fragment. The tumorigenic fragment included the c-$rc(527) 
oncogene, as flanked by the same LTR complement as in pMvsrc. 

C. Animals 

Chickens of two closed lines, SC and TK, were utilized. These 
lines differ at the major histocompatibility (B) complex for the SC line, 
20 B^/B 11 for the TK line). Embryonated eggs were obtained from Hyline 
International (Dallas Center, 1A). All chickens were hatched at the University 
of New Hampshire Poultry Research Farm and housed in isolation. 

D. Tumor Induction by Plasmid DNA 

Tumors were induced by subcutaneous inoculation in the wing 
25 web of a .src-positive plasmid according to the technique described by Fung et 
al (1983) Proc. Natl Acad. Sci. USA 80, 353-357 and Halpern et al, (1990) 
Virology 175, 328-331. Of the three tumorigenic plasmids utilized here, all 
were adjusted, prior to inoculation, to a concentration of 100 of enzyme- 
restricted DNA per 100 y\ of phosphate-buffered saline. The conditions of 
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inoculation used for particular experiments (age of chicken at time of 
inoculation, amount of plasmid, etc.) are indicated below. 

E. Growth of Primary (wing web) Tumors in TK or SC Chickens 

Inoculated with pVSRC-Cl. oM vsrc or pcsrcSll 

5 Individual 1 -day-old chickens of line TK or of line SC were 

inoculated with 100 fig of either pVSRC-Cl, pM vsrc or pcsrc527. The mean 
tumor diameter (mm) at a particular time point and for any one group of TK or 
SC line chickens inoculated with an individual s/r-positive construct was 
computed as the sum of the diameters of the primary tumors divided by the 

10 number of chickens surviving to that point. The results are shown in Fig. 1A 
(line TK) and Fig. IB (line SC). The ratios at each time point show, for a 
particular group, the number of chickens bearing palpable tumors to the total 
number of survivors to that point (standard typeface for pcsrc527, italics for 
pVSRC-Cl, bold typeface for pMVjrc). Error bars (unless obscured by the 

15 symbol) indicate standard error. 

F. Growth of Challenge (wing web) Tumors in Test and Control 

Line TK Chickens Under Conditions of Priming and Homologous 
Challenge with pcsrc527. or Priming and Homologous Challenge 
with pVSRC-Cl 

20 Growth of challenge (wing web) tumors in test and control line 

TK chickens was determined under conditions of (i) priming and homologous 
challenge with pcsrc527, or (ii) priming and homologous challenge with 
pVSRC-Cl. Test chickens were primed at 1 day posthatch with 100 /*g of 
construct; test and control chickens were challenged at five weeks posthatch 

25 with 200 /xg of construct. The mean challenge tumor diameter was computed 
as described in the preceding section. At each time point the ratio of chickens 
bearing palpable challenge tumors to total number of survivors to that point is 
indicated for priming and homologous challenge with pcsrc527 (Fig. 2, panel 
A) and priming and homologous challenge with pVSRC-Cl (Fig. 2, panel B) 
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(standard typeface for control group, bold typeface for test group). The 
statistical comparison between the mean challenge tumor diameters of the test 
versus the control group at a particular time point was made using a two-tailed 
student's t test, *(p<0.05), **(p<0.01), ***(p< 0.001). The statistical 
5 comparison between the ratios of chickens bearing palpable challenge tumors to 
total number of survivors of the test versus the control group at a particular 
time point was made using a chi-squared test; the paired ratios are underlined 
for only those time points where p<0.05. Error bars indicate standard error. 



G. Growth of Challenge (wing web) Tumors in Test and Control 

10 line TK chickens under Conditions of Priming with pVSRC-Cl 

and Heterologous Challenge with pcsrc527. or Priming with 
pcsrc527 and Heterologous Challenge with pVSRC-Cl 
Growth of challenge (wing web) tumors in test and control line 
TK chickens, was determined under conditions of (i) priming with pVSRC-Cl 

15 and heterologous challenge with pcyrc527, or (ii) priming with pcrrc527 and 
heterologous challenge with pVSRC-Cl. Test chickens were primed at 1 day 
posthatch with 100 fig of construct; test and control chickens were challenged 
at five weeks posthatch with 200 /xg of construct. The mean challenge tumor 
diameter was computed as described in Section E. At each time point the ratio 

20 of chickens bearing palpable challenge tumors to total number of survivors to 
that point is indicated for priming with pVSRC-Cl and heterologous challenge 
with pcsrc527 (Fig. 3, panel A) and priming with pcjrc527 and heterologous 
challenge with pVSRC-Cl (Fig. 3, panel B) (standard typeface for control 
group, bold typeface for test group). Statistical comparisons were made 

25 between test and control groups at a particular time point as described in the 
preceding section [*(p<0.05), **(p<0.01), ***(p<0.001), for the student's 
t test], and the paired ratios are underlined for only those time points where, in 
the chi-squared test, p<0.05. Error bars indicate standard error. 
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H. Discussion 

In a direct comparison of the growth of tumors induced in line 
TK by either pMvsrc or pVSRC-Cl, a similar pattern of relatively rapid 
regression was observed. This result established that the difference in LTR 
5 complement between these two v-src positive constructs did not exert a major 
influence on the tumor growth pattern in the TK line (Fig. 1, panel A). By 
contrast, much more extensive and persistent tumor growth resulted from 
inoculation of TK chickens with the pcjrc527 construct (Fig. 1, panel A). The 
relatively greater growth capacity of tumors induced by this construct indicated 
10 that in the TK line, the c-j/r(527) oncogene is much more highly tumorigenic 
than the v-src oncogene. This difference did not, however, generalize to the SC 
line (Fig. 1, panel B). The SC line was chosen for comparison with the TK 
line on the basis of earlier observations (Halpern et al, (1993) Virology 197, 
480-484) that v-src DNA-induced tumors engender a much weaker tumor 
15 immune response in line SC than in line TK. Whereas the growth of pcsrc521- 
induced primary tumors was virtually indistinguishable in the two lines, the 
growth of the v-sroinduced tumors was considerably greater in the SC than in 
the TK line (Fig. 1). Thus v-src, but not c -src{521), gives rise to primary 
tumors whose growth patterns differ in the two lines analyzed here. 
20 Only minimal protection against homologous challenge was 

observed under conditions of priming to c-src(527) DNA, indicative of the 
induction of a relatively weak tumor immune response (Fig. 2, panel A; a 
statistically significant lowering of challenge tumor growth in the test versus the 
control chickens was observed at only one time point). By contrast, the v-src 
25 DNA-primed chickens showed excellent protection against the homologous 
tumor challenge (Fig. 2, panel B). 

Priming with v-src DNA engenders a relatively greater degree of 
protection against challenge with c-src(527) DNA, than that afforded by priming 
with c-j/r(527) DNA itself (Fig. 3, panel A). The degree of protection was 
30 weaker than that determined (Fig. 2. panel B) for the case of priming and 
homologous challenge with v-src DNA. Only marginal protection was 
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observed, however, when the heterologous challenge protocol was carried out 
in the reverse order (Fig. 3, panel B). These results demonstrate that induction 
of reactivity to an antigenicity specified in tumor cells by an overexpressed 
proto-oncogene can confers tumor immunity. 



5 Example 2 

Vaccination Protocol 

The following is a representative vaccination protocol according 
to the present invention. 

A. Skin Punch Biopsy 

10 A punch biopsy of skin is obtained by a trained physician 

following standard medical practice. 



B. Preparation of Primary Fibroblast Culture 

Under sterile conditions, the skin obtained by punch biopsy is put 
in a tube with 10 ml of the following wash medium: Dulbecco's Modified 

15 Eagle Medium (DMEM), containing sodium bicarbonate (30 ml/liter of a 5.6% 
solution) and penicillin/streptomycin (2 ml/liter of a pen-strep stock solution 
containing 5000 units penicillin and 5000 /xg of streptomycin/ml, pH 7.2-7.4.). 
In a sterile hood, the skin biopsy is added to a Petri dish, and then transferred 
several times to new Petri dishes containing the same wash medium. The 

20 biopsy is then finely minced with two scalpels, and 2-4 pieces (< 1 mm 3 ) of the 
minced biopsied are placed in the middle part of one or more T25 flasks. The 
flask is placed in a tissue culture incubator at 37 °C for one half hour with the 
cap firmly closed, then opened for 10 minutes. The following culture medium 
is prepared: DMEM containing sodium bicarbonate; antibiotics; and 10% fetal 

25 calf serum containing 2.5 fig/ml fungizone, 40 ^g/ml gentamicin, and 1% 
glutamine( 3% W/V). Two ml of the culture medium is then added to the flask, 
and the flask is incubated at 37°C (5% C0 2 ), with the cap lightly unscrewed. 
The flask is left for three days without moving so as to obtain adhesion of the 
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separate pieces of skin to the plastic. Afterwards, the medium is changed two 
times per week over a 3-4 week period always adding 2-3 ml of medium. To 
trypsinize the skin cell culture, one needs zones of confluence. After aspirating 
the culture medium, 5 ml of the Puck's Saline A/EDTA solution (0.4 g EDTA 
5 to 1 liter of Puck's Solution A) is added and immediately aspirated. Then 1 ml 
of trypsin solution (0.05/0.02% trypsin in PBS, without Ca++ or Mg++ ) is 
added and incubated for 5 min at 37 °C, at which time 2 ml of culture fluid is 
added to stop the action of the trypsin. The cells are then transferred to a larger 
flask (775) and incubated at 37 °C in 15 ml of culture fluid, which is changed 
10 every 2 days. 

C. Fibroblast Transfection 

The fibroblasts (2 X 10 5 cells) are washed twice in DMEM 
without serum or antibiotics. A LipofectAMINE™-DNA solution is prepared 
by mixing in tube #1 mix 400/iI DMEM and 10^1 of dCTG vector DNA 

15 (l^g/ul). In tube #2, 400 /xl DMEM and 25 Ml of Lipofect AMINE Reagent 
(Life Technologies, cat. no. 18324-012) are mixed. The contents of tube #1 
and #2 are mixed together and are then left sitting at room temperature for 30 
hours. Then, 3.2 ml of the Lipofect AMINE™- DNA solution is added to the 
cells. The cells are incubated for six hours at 37°C, washed once with Hank's 

20 Balanced Salt Solution, and then refed with growth medium and incubated for 
an additional 24 hours at 37 °C 

D. Transfectant Irradiation 

Transfectants are irradiated to a dose of 25 By or 2500R. the 
cells are then counted by trypan blue exclusion. 2 X 10 7 irradiated transfectants 
25 are resuspended in a volume of 0.2-0.4 ml of Hanks Balanced Salt Solution. 

E. Vaccination 

Patients are vaccinated by subcutaneous inoculation of 2 X 10 7 
irradiated cells at 2-3 week intervals. A shorter or longer regimen is used, 
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depending upon the results of delayed type hypersensitivity (DTH) reaction 
monitoring (described below). 

F. Patient Assessment by DTH Monitoring 

Patients are assessed for reactivity to the irradiated transfectants 
5 by a test of skin reactivity in a DTH reaction, as described by Chang et ai 
(1993), Cancer Research 53:1043-1050. To measure reactivity to the 
autologous irradiated transfectants, 10 4 - 10* transfected irradiated cells in a 
volume of 0.1 ml HBSS are inoculated intradermal ly. Induration is measured 
48 hours later, as an average of two perpendicular diameters. Responses of 
10 greater than 2 mm are considered positive. 

Example 3 
\-fnxc Transfection of Murine Fibroblasts 

A. Vector Preparation 

The v-myc retroviral oncogene of avian myelocytomatosis virus 
15 MC29 (Land et al. (1983), Nature 304:596-602) was obtained from the 
American Type Culture Collection, Rockville, MD, 20852, as the pSVv-myc 
vector (ATCC No. 45014). The v-myc-positive EcoKl-Kpnl fragment of pSVv- 
myc was ligated into the polylinker sites of the pBK-CMV plasmid (Stratagene 
Cloning Systems, La Jolla, CA). 

20 B. Cell Transfection 

Stable transfection using the pBK-CMV-v-royc vector was carried 
out on a line of A31 fibroblasts (Balb/c origin), obtained from the ATCC. 2 
X 10 5 cells were seeded in a 100 mm/dish and allowed to grow for 18-20 h 
(RPMI 1640 medium and 10% fetal bovine serum), at which time the cells 

25 reached 50-70% confluence. The cells were then washed twice in Dulbecco's 
Modified Eagles Medium (without serum or antibiotics). A LipofectAMINE™- 
DNA solution was prepared according to Example 2.C, with the pBK-CMV-v- 
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rnyc vector DNA, and 3.2 ml of the LipofectAMINE™-DNA solution added to 
the cells. The cells were then incubated for 6 hours at 37°C, washed once with 
Hank's Balanced Salt Solution, and then refed with the growth medium and 
incubated for an additional 24 hour at 37 °C. Thereafter, the cells were fed 
5 once every two days with growth medium containing 250 /xg/ml geneticin 
(G418; Gibco BRL cat. no. 11811) as the selective marker. Within two weeks, 
colonies were picked and expanded into permanent cell lines. The cells were 
then washed and collected by centrifugation. 

It should be noted that the procedure for transient transfection is 
10 the same, through the point of incubation with the Lipofectamine™-DNA 
solution. Thereafter, the cells are washed and incubated for 72 hours in growth 
medium. 



All references cited with respect to synthetic, preparative and analytical 
procedures are incorporated herein by reference. 
15 The present invention may be embodied in other specific forms without 

departing from the spirit or essential attributes thereof and, accordingly, 
reference should be made to the appended claims, rather than to the foregoing 
specification, as indication the scope of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Allegheny University of the Health Sciences 

Halpern, Michael S. 
England, James M. 

(ii) TITLE OF INVENTION: CANCER VACCINE 

(iii> NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Seidel , Gonda, Lavorgna 6 Monaco, P.C. 

(B) STREET: Suite 1800, Two Penn Center Plaza 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19102 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM; PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/010,262 

(B) FILING DATE: 19-JAN-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Monaco, Daniel A. 

(B) REGISTRATION NUMBER: 30,480 

(C) REFERENCE /DOCKET NUMBER: 7933-33 PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (215) 568-8383 

(B) TELEFAX: (215) 568-5549 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CCAGTTCCAG CCTGGAGAGA ACCTATA 27 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GATCTATAGG TTCTCTCCAG GCTGGAACTG GGGCT 3 5 

<2> INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



GAGACTGTGC 


CCTGTCCACG 


GTGCCTCCTG 


CATGTCCTGC 


TGCCCTGAGC 


TGTCCCGAGC 


60 


TAGGTGACAG 


CGTACCACGC 


TGCCACCATG 


AATGAGGTGT 


CTGTCATCAA 


AGAAGGCTGG 


120 


CTCCACAAGC 


GTGGTGAATA 


CATCAAGACC 


TGGAGGCCAC 


GGTACTTCCT 


GCTGAAGAGC 


180 


GACGGCTCCT 


TCATTGGGTA 


CAAGGAGAGG 


CCCGAGGCCC 


CTGATCAGAC 


TCTACCCCCC 


240 


TTAAACAACT 


TCTCCGT AG C 


AGAATGCCAG 


CTGATGAAGA 


CCGAGAGGCC 


GCGACCCAAC 


300 


ACCTTTGTCA 


TACGCTGCCT 


GCAGTGGACC 


ACAGTCATCG 


AGAGGACCTT 


CCACGTGGAT 


360 


TCTCCAGACG 


AGAGGGAGGA 


GTGGATGCGG 


GCCATCCAGA 


TGGTCGCCAA 


CAGCCTCAAG 


420 


CAGCGGGCCC 


CAGGCGAGGA 


CCCCATGGAC 


TACAAGTGTG 


GCTCCCCCAG 


TGACTCCTCC 


480 


ACGACTGAGG 


AGATGGAAGT 


GGCGGTCAGC 


AAGGCACGGG 


CTAAAGTGAC 


CATGAATGAC 


540 


TTCGACTATC 


TCAAACTCCT 


TGGCAAGGGA 


ACCTTTGGCA 


AAGTCATCCT 


GGTGCGGGAG 


600 


AAGGCCACTG 


GCCGCTACTA 


CGCCATGAAG 


ATCCTGCGAA 


AGGAAGTCAT 


CATTGCCAAG 


660 


GATGAAGTCG 


CTCACACAGT 


CACCGAGAGC 


CGGGTCCTCC 


AGAACACCAG 


GCACCCGTTC 


720 


CTCACTGCGC 


TGAAGTATGC 


CTTCCAGACC 


CACGACCGCC 


TGTGCTTTGT 


GATGGAGTAT 


780 


GCCAACGGGG 


GTGAGCTGTT 


CTTCCACCTG 


TCCCGGGAGC 


GTGTCTTCAC 


AGAGGAGCGG 


840 


GCCCGGTTTT 


ATGGTGCAGA 


GATTGTCTCG 


GCTCTTGAGT 


ACTTGCACTC 


GCGGGACGTG 


900 


GTATACCGCG 


ACATCAAGCT 


GGAAAACCTC 


ATGCTGGACA 


AAGATGGC C A 


CATCAAGATC 


960 


ACTGACTTTG 


GCCTCTGCAA 


AGAGGGCATC 


AGTGACGGGG 


CCACCATGAA 


AACCTTCTGT 


1020 


GGGACCCCGG 


AGTACCTGGC 


GCCTGAGGTG 


CTGGAGGACA 


ATGACTATGG 


CCGGGCCGTG 


1080 


GACTGGTGGG 


GGCTGGGTGT 


GGTCATGTAC 


GAGATGATGT 


GCGGCCGCCT 


GCCCTTCTAC 


1140 


AACCAGGACC 


ACGAGCGCCT 


CTTCGAGCTC 


ATCCTCATGG 


AAGAGATCCG 


CTTCCCGCGC 


1200 


ACGCTCAGCC 


CCGAGGCCAA 


GTCCCTGCTT 


GCTGGGCTGC 


TTAAGAAGGA 


CCCCAAGCAG 


1260 


AGGCTTGGTG 


GGGGGCCCAG 


CGATGCCAAG 


GAGGTCATGG 


AGCACAGGTT 


CTTCCTCAGC 


1320 


ATCAACTGGC 


AGGACGTGGT 


CCAGAAGAAG 


CTCCTGCCAC 


CCTTCAAACC 


TCAGGTCACG 


1380 


TCCGAGGTCG 


ACACAAGGTA 


CTTCGATGAT 


GAATTTACCG 


CCCAGTCCAT 


CACAATCACA 


1440 


CCCCCTGACC 


GCTATGACAG 


CCTGGGCTTA 


CTGGAGCTGG 


ACCAGCGGAC 


CCACTTCCCC 


1500 
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CAGTTCTCCT ACTCGGCCAG CATCCGCGAG TGAGCAGTCT GCCCACGCAG AGGACGCACG 1560 
CTCGCTGCCA TCACCGCTGG GTGGTTTTTT ACCCCTGCC 1599 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



AATTCTCGAG 


CTCGTCGACC 


GGTCGACGAG 


CTCGAGGGTC 


GACGAGCTCG 


AGGGCGCGCG 


60 


CCCGGCCCCC 


ACCCCTCGCA 


GCACCCCGCG 


CCCCGCGCCC 


TCCCAGCCGG 


GTCCAGCCGG 


120 


AGCCATGGGG 


CCGGAGCCGC 


AGTGAGCACC 


ATGGAGCTGG 


CGGCCTTGTG 


CCGCTGGGGG 


180 


CTCCTCCTCG 


CCCTCTTGCC 


CCCCGGAGCC 


GCGAGCACCC 


AAGTGTGCAC 


CGGCACAGAC 


240 


ATGAAGCTGC 


GGCTCCCTGC 


CAGTCCCGAG 


ACCCACCTGG 


ACATGCTCCG 


CCACCTCTAC 


300 


CAGGGCTGCC 


AGGTGGTGCA 


GGGAAACCTG 


GAACTCACCT 


ACCTGCCCAC 


CAATGCCAGC 


360 


CTGTCCTTCC 


TGCAGGATAT 


CCAGGAGGTG 


CAGGGCTACG 


TGCTCATCGC 


TCACAACCAA 


420 


GTGAGGCAGG 


TCCCACTGCA 


GAGGCTGCGG 


ATTGTGCGAG 


GCACCCAGCT 


CTTTGAGGAC 


480 


AACTATGCCC 


TGGCCGTGCT 


AGACAATGGA 


GACCCGCTGA 


ACAATACCAC 


CCCTGTCACA 


540 


GGGGCCTCCC 


CAGGAGGCCT 


GCGGGAGCTG 


CAGCTTCGAA 


GCCTCACAGA 


GATCTTGAAA 


600 


GGAGGGGTCT 


TGATCCAGCG 


GAACCCCCAG 


CTCTGCTACC 


AGGACACGAT 


TTTGTGGAAG 


660 


GACATCTTCC 


ACAAGAACAA 


CCAGCTGGCT 


CTCACACTGA 


TAGACACCAA 


CCGCTCTCGG 


720 


GCCTGCCACC 


CCTGTTCTCC 


GATGTGTAAG 


GGCTCCCGCT 


GCTGGGGAGA 


GAGTTCTGAG 


780 


GATTGTCAGA 


GCCTGACGCG 


CACTGTCTGT 


GCCGGTGGCT 


GTGCCCGCTG 


CAAGGGGCCA 


840 


CTGCCCACTG 


ACTGCTGCCA 


TGAGCAGTGT 


GCTGCCGGCT 


GCACGGGCCC 


CAAGCACTCT 


900 


GACTGCCTGG 


CCTGCCTCCA 


CTTCAACCAC 


AGTGGCATCT 


GTGAGCTGCA 


CTGCCCAGCC 


960 


CTGGTCACCT 


ACAACACAGA 


CACGTTTGAG 


TCCATGCCCA 


ATCCCGAGGG 


CCGGTATACA 


1020 


TTCGGCGCCA 


GCTGTGTGAC 


TGCCTGTCCC 


TACAACTACC 


TTTCTACGGA 


CGTGGGATCC 


1080 


TGCACCCTCG 


TCTGCCCCCT 


GCACAACCAA 


GAGGTGACAG 


CAGAGGATGG 


AACACAGCGG 


1140 


TGTGAGAAGT 


GCAGCAAGCC 


CTGTGCCCGA 


GTGTGCTATG 


GTCTGGGCAT 


GGAGCACTTG 


1200 


CGAGAGGTGA 


GGGCAGTTAC 


CAGTGCCAAT 


ATCCAGGAGT 


TTGCTGGCTG 


CAAGAAGATC 


1260 


TTTGGGAGCC 


TGGCATTTCT 


GCCGGAGAGC 


TTTGATGGGG 


ACCCAGCCTC 


CAACACTGCC 


1320 


CCGCTCCAGC 


CAGAGCAGCT 


CCAAGTGTTT 


GAGACTCTGG 


AAGAGATCAC 


AGGTTACCTA 


1380 


TACATCTCAG 


CATGGCCGGA 


CAGCCTGCCT 


GACCTCAGCG 


TCTTCCAGAA 


CCTGCAAGTA 


1440 


ATCCGGGGAC 


GAATTCTGCA 


CAATGGCGCC 


TACTCGCTGA 


CCCTGCAAGG 


GCTGGGCATC 


1500 
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A r;pTr;R p tgg 


GGCTGCGCTC 


ACTGAGGGAA 


CTGGGCAGTG 


GACTGGCCCT 


CATCCACCAT 


1560 


aapapppapp 


TPTGPTTPGT 


GCACACGGTG 


CCCTGGGACC 


AGCTCTTTCG 


GAACCCGCAC 


1620 


L HHVJV, 1 V_ 1 VJU 


TPPACACTGC 


CAACCGGCCA 


GAGGACGAGT 


GTGTGGGCGA 


GGGCCTGGCC 


16B0 


ttt*/** a r*p app 
ioLLA^LAoL 


tgtgpgpppg 


AGGG P APTGP 


TGGGGTCCAG 


GGCCCACCCA 


GTGTGTCAAC 


1740 


I LjLAvjLLALj 1 


1 LL 1 1 LOOOO 


PPAGGAGTGP 


GTGGAGGAAT 


GCCGAGTACT 


GCAGGGGCTC 


1800 


L L LAb boAo 1 


ATPTPA ATPP 


PAPPPAPTPT 


TTGPPGTGPC 


ACCCTGAGTG 


TCAGCCCCAG 


1860 


AATGGCTLAo 




tpp a ppn^a a 


GPTG AP P AGT 

V7L X \Jf\\~ \*.*VJ 1 


GTGTGGCCTG 


TGCCCACTAT 


1920 


a AOf^prrTr 
AAbbAC LLit 




flnPPPfiPTP.P 
OOLLLOL 1 UL 


PPPAGPGGTG 


TGAAACCTGA 


CCTCTCCTAC 


1980 




00>i/\0 1 1 ILv. 


AGATGAGGAG 


GGCGCATGCC 


AGCCTTGCCC 


CATCAACTGC 


2040 


ALLL AL. 1 LL 1 




OUn 1 UnUviU 


UV3L 1VJLLLV.VJ 


C CG AGC AG AG 


AGCCAGCCCT 


2100 


PTP.APPTPPA 


TPGTPTPTGP 


GGTGGTTGGP 

OO X \JVJ X X UUv, 


ATTCTGCTGG 


TCGTGGTCTT 


GGGGGTGGTC 


2160 


TTTfinn. atpp 

1 1 looVjAILL 


TPATPA AGPG 


APGGPAGPAG 


AAGATPPGGA 


AGTACACGAT 


GCGGAGACTG 

1 tm9 \ v > ^— * W<i •» ^Wi* turn ^m* 


2220 


r ,T mr , Ti.nn a a a 

L 1 OLAoOArt/* 


pppapiptppt 


GGAGPPGPTG 


AP A PP T AGCG 


GAGCGATGCC 


CAACCAGGCG 


2280 


P AP. A Tf2 PPfJ A 


TPPTn^AAfiA 
X V— x O/wtnOJ'V 


GAPGGAGPTG 


AGGAAGGTGA 


AGGTGCTTGG 


ATCTGGCGCT 


2340 


ill ovtL ALAO 


TPTAP A Afififi 


PATPTGGATP 

Ln x v* 1 wn x v. 


PPTGATGGGG 


AGAATGTGAA 


AATTCCAGTG 


2400 


r*r*r* a tv a a a r» 

va L LA l\,AAAu 


loll OA000A 


A A APAPATPP 


PPP AAAGPPA 


ACAAAGAAAT 


CTTAGACGAA 


2460 


uLA 1 ALu 1 un 


1 uuL 1 LtO X O 1 


GGGPTPPPPA 

OOOL X V_ V_\_V-rt 


TATGTPTPCC 

X n. X O X X 


GCCTTCTGGG 


CATCTGCCTG 


2520 


A P a TP" H A PPT* 
ALAJ LLALoVj 


1 uL AoL 1 vjLj 1 


PAPAPAPPTT 


ATGPPPTATG 


GPTGPCTCTT 


AGACCATGTC 


2580 


LwtibAAAAL L 


nrr.fiarnpp'p 

uLuuA^vLL 1 


ptpp p a r: 

OOO L XLLLftO 


GAPPTGPTGA 

OAL V» X O V— X \Jn 


APTGGTGTAT 


GCAGATTGCC 


2640 


AAoVjLjnj A 1 uA 


PPTRPPTPP A 
1 ALL 1 uuA 


uun 1 0 1 VjLvjvj 


PTPGTAPAPA 


GGG APTTGG C 


CGCTCGGAAC 


2700 


\j 1 (jL 1 00 1 LA 


Hp RPTPPP7\ A 
AoAO 1 L LLAA 


L LA X 0 1 L AAA 


ATTAPAPIAPT 
All ALxAVjAL 1 


TPGGGPTGGC 


TPGGCTGCTG 


2760 


P7\nj\ ipiip 7\ PP 
bALAl 1 vjALo 


IfiriPR PT A 


P*P* ATPT* An AT 
LL A 1 «LAuA X 




TGPPPATCAA 


GTGGATGGCG 


2820 


L loo AO 1 Uv-ft 


TTPTnPPPPP 


PPPPTTPAPP 
bLuu X X LnLL 


PAPPAGAGTG 


ATGTGTGGAG 


TTATGGTGTG 


2880 


i-vL luiul uuu 


apptpatpap 

XOL 1 wM. 1 0/-VL 


TTTTGGGG PP 

X X X X VJUUULU 


AAAPPTTACG 


ATGGGATCCC 


AGCCCGGGAG 

«*• LJ \> >— . \ J VJX»W 


2 94 0 


ATPPPTGAPP 
/\ A X OrtV» \_ 


TGPTGGAAAA 


GGGGGAGPGG 


CTGCCCCAGC 


CCCCCATCTG 


CACCATTGAT 


3000 


GTCTACATGA 


TCATGGTCAA 


ATGTTGGATG 


ATTGACTCTG 


AATGTCGGCC 


AAGATTCCGG 


3060 


GAGTTGGTGT 


CTGAATTCTC 


CCGCATGGCC 


AGGGACCCCC 


AGCGCTTTGT 


GGTCATCCAG 


3120 


AATGAGGACT 


TGGGCCCAGC 


CAGTCCCTTG 


GACAGCACCT 


TCTACCGCTC 


ACTGCTGGAG 


3180 


GACGATGACA 


TGGGGGACCT 


GGTGGATGCT 


GAGGAGTATC 


TGGTACCCCA 


GCAGGGCTTC 


3240 


TTCTGTCCAG 


ACCCTGCCCC 


GGGCGCTGGG 


GGCATGGTCC 


ACCACAGGCA 


CCGCAGCTCA 


3300 


TCTACCAGGA 


GTGGCGGTGG 


GGACCTGACA 


CTAGGGCTGG 


AGCCCTCTGA 


AGAGGAGGCC 


3360 


CCCAGGTCTC 


CACTGGCACC 


CTCCGAAGGG 


GCTGGCTCCG 


ATGTATTTGA 


TGGTGACCTG 


3420 
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GGAATGGGGG CAGCCAAGGG GCTGCAAAGC CTCCCCACAC ATGACCCCAG CCCTCTACAG 34 8 0 

CGGTACAGTG AGGACCCCAC AGTACCCCTG CCCTCTGAGA CTGATGGCTA CGTTGCCCCC 3 54 0 

CTGACCTGCA GCCCCCAGCC TGAATATGTG AACCAGCCAG ATGTTCGGCC CCAGCCCCCT 3600 

TCGCCCCGAG AGGGCCCTCT GCCTGCTGCC CGACCTGCTG GTGCCACTCT GGAAAGGGCC 366 0 

AAGACTCTCT CCCCAGGGAA GAATGGGGTC GTCAAAGACG TTTTTGCCTT TGGGGGTGCC 372 0 

GTGGAGAACC CCGAGTACTT GACACCCCAG GGAGGAGCTG CCCCTCAGCC CCACCCTCCT 3 78 0 

CCTGCCTTCA GCCCAGCCTT CGACAACCTC TATTACTGGG ACCAGGACCC ACCAGAGCGG 3840 

GGGGCTCCAC CCAGCACCTT CAAAGGGACA CCTACGGCAG AGAACCCAGA GTACCTGGGT 3 900 

CTGGACGTGC CAGTGTGAAC CAGAAGGCCA AGTCCGCAGA AGCCCTGATG TGTCCTCAGG 396 0 

GAGCAGGGAA GGCCTGACTT CTGCTGGCAT CAAGAGGTGG GAGGGCCCTC CGACCACTTC 4 020 

CAGGGGAACC TGCCATGCCA GGAACCTGTC CTAAGGAACC TTCCTTCCTG CTTGAGTTCC 4080 

CAGATGGCTG GAAGGGGTCC AGCCTCGTTG GAAGAGGAAC AGCACTGGGG AGTCTTTGTG 4140 

GATTCTGAGG CCCTGCCCAA TGAGACTCTA GGGTCCAGTG GATGCCACAG CCCAGCTTGG 4200 

CCCTTTCCTT CCAGATCCTG GGTACTGAAA GCCTTAGGGA AGCTGGCCTG AGAGGGGAAG 4260 

CGGCCCTAAG GGAGTGTCTA AGAACAAAAG CGACCCATTC AGAGACTGTC CCTGAAACCT 4320 

AGTACTGCCC CCCATGAGGA AGGAACAGCA ATGGTGTCAG TATCCAGGCT TTGTACAGAG 4380 

TGCTTTTCTG TTTAGTTTTT ACTTTTTTTG TTTTGTTTTT TTAAAGACGA AATAAAGACC 444 0 

CAGGGGAGAA TGGGTGTTGT ATGGGGAGGC AAGTGTGGGG GGTCCTTCTC CACACCCACT 4 5 00 

TTGTCCATTT GCAAATATAT TTTGGAAAAC 4530 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 60 

GCTTCGGAAC AAGAGACCCT GGATCTTGAT GCTGGTGTAA GTGAACATTC AGGTGATTGG 120 

TTGGATCAGG ATTCAGTTTC AGATCAGTTT AG TGTAGAAT TTGAAGTTGA ATCTCTCGAC 180 

TCAGAAGATT ATAGCCTTAG TGAAGAAGGA CAAGAACTCT CAGATGAAGA TGATGAGGTA 24 0 

TATCAAGTTA CTGTGTATCA GGCAGGGGAG AGTGATACAG ATTCATTTGA AGAAGATCCT 3 00 

GAAATTTCCT TAGCTGACTA TTGGAAATGC ACTTCATGCA ATGAAATGAA TCCCCCCCTT 36 0 

CCATCACATT GCAACAGATG TTGGGCCCTT CGTGAGAATT GGCTTCCTGA AGATAAAGGG 42 0 

AAAGATAAAG GGGAAATCTC TGAGAAAGCC AAACTGGAAA ACTCAACACA AGCTGAAGAG 480 
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GGCTTTGATG 


TTCCTGATTG TAAAAAAACT 


ATAGTGAATG 


ATTCCAGAGA 


GTCATGTGTT 


540 


GAGGAAAATG 


ATGATAAAAT TACACAAGCT 


TCACAATCAC 


AAGAAAGTGA AGACTATTCT 


600 


CAGCCATCAA 


CTTCTAGTAG CATTATTTAT 


AGCAGCCAAG 


AAGATGTGAA 


AGAGTTTGAA 


660 


AGGGAAGAAA 


CCCAAGACAA AGAAGAGAGT 


GTGGAATCTA 


GTTTGCCCCT 

• 


TAATGCCATT 


720 


GAACCTTGTG 


TGATTTGTCA AGGTCGACCT 


AAAAATGGTT 


GCATTGTCCA 


TGGCAAAACA 


780 


GGACATCTTA 


TGGCCTGCTT TACATGTGCA 


AAGAAGCTAA 


AGAAAAGGAA 


TAAGCCCTGC 


840 


CCAGTATGTA 


GACAACCAAT TCAAATGATT 


GTGCTAACTT 


ATTTCCCCTA 


G 


891 


(2) INFORMATION FOR SEQ ID NO: 6: 











<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
ATG TGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 
GCTTCGGAAC AAGAGACCCT GGACTATTGG AAATGCACTT CATGCAATGA AATGAATCCC 
CCCCTTCCAT CACATTGCAA CAGATGTTGG GCCCTTCGTG AGAATTGGCT TCCTGAAGAT 
AAAGGGAAAG ATAAAGGGGA AATCTCTGAG AAAGCCAAAC TGGAAAACTC AACACAAGCT 
GAAGAGGGCT TTGATGTTC C TGATTGTAAA AAAACTATAG TGAATGATTC CAGAGAGTCA 
TGTGTTGAGG AAAATGATGA TAAAATTACA CAAGCTTCAC AATCACAAGA AAGTGAAGAC 
TATTCTCAGC CATCAACTTC TAGTAGCATT ATTTATAGCA GCCAAGAAGA TGTGAAAGAG 
TTTGAAAGGG AAGAAACCCA AGACAAAGAA GAGAGTGTGG AATCTAGTTT GCCCCTTAAT 
GCCATTGAAC CTTGTGTGAT TTGTCAAGGT CGACCTAAAA ATGGTTGCAT TGTCCATGGC 
AAAACAGGAC ATCTTATGGC CTGCTTTACA TGTGCAAAGA AGCTAAAGAA AAGGAATAAG 
CCCTGCCCAG TATGTAGACA ACCAATTCAA ATGATTGTGC TAACTTATTT CCCCTAG 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 
GCTTCGGAAC AAGAGACCCT GGTTAGACCA AAGCCATTGC TTTTGAAGTT ATTAAAGTCT 
GTTGGTGCAC AAAAAGACAC TTATACTATG AAAGAGGATC TTGATGCTGG TGTAAGTGAA 
CATTCAGGTG ATTGGTTGGA TCAGGATTCA GTTTCAGATC AGTTTAGTGT AGAATTTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
657 



60 
120 
180 
240 



WO 97/25860 



- 60 - 



PCT/US97/00582 



GTTGAATCTC 


TCGACTCAGA 


AG ATT AT AG C 


CTTAGTGAAG 


AAGGACAAGA 


ACTCTCAGAT 


300 


GAAGATGATG 


AGGTATATCA 


AGTTACTGTG 


TATCAGGCAG 


GGGAGAGTGA 


TACAGATTCA 


360 


TTTGAAGAAG 


ATCCTGAAAT 


TTCCTTAGCT 


GACTATTGGA 


AATGCACTTC 


ATGCAATGAA 


420 


ATGAATCCCC 


CCCTTCCATC 


ACATTGCAAC 


AGATGTTGGG 


CCCTTCGTGA 


GAATTGGCTT 


480 


CCTGAAGATA 


AAGGGAAAGA 


TAAAGGGGAA 


ATCTCTGAGA 


AAGCCAAACT 


GGAAAACTCA 


540 


ACACAAGCTG 


AAGAGGGCTT 


TGATGTTCCT 


GATTGTAAAA 


AAACTATAGT 


GAATGATTCC 


600 


AGAGAGTCAT 


GTGTTGAGGA 


AAATGATGAT 


AAAATTACAC 


AAGCTTCACA 


ATCACAAGAA 


660 


AGTGAAGACT 


ATTCTCAGCC 


ATCAACTTCT 


AGTAGCATTA 


TTTATAGCAG 


CCAAGAAGAT 


720 


GTGAAAGAGT 


TTGAAAGGGA 


AGAAACCCAA 


GACAAAGAAG 


AGAGTGTGGA 


ATCTAGTTTG 


780 


CCCCTTAATG 


CCATTGAACC 


TTGTGTGATT 


TGTCAAGGTC 


GACCTAAAAA 


TGGTTGCATT 


840 


GTCCATGGCA 


AAACAGGACA 


TCTTATGGCC 


TGCTTTACAT 


GTGCAAAGAA 


GCTAAAGAAA 


900 


AGGAATAAGC 


CCTGCCCAGT 


ATGTAGACAA 


CCAATTCAAA 


TGATTGTGCT 


AACTTATTTC 


960 


CCCTAG 












966 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 60 

GCTTCGGAAC AAGAGACCCT GGTTAGACAA GAAAGTGAAG ACTATTCTCA GCCATCAACT 120 

TCTAGTAGCA TTATTTATAG CAGCCAAGAA GATGTGAAAG AGTTTGAAAG GGAAGAAACC 180 

CAAGACAAAG AAGAGAGTGT GGAATCTAGT TTGCCCCTTA ATGCCATTGA ACCTTGTGTG 240 

ATTTGTCAAG GTCGACCTAA AAATGGTTGC ATTGTCCATG GCAAAACAGG ACATCTTATG 3 00 

GCCTGCTTTA CATGTGCAAA GAAGCTAAAG AAAAGGAATA AGCCCTGCCC AGTATGTAGA 360 

CAAC CAATTC AAATGATTGT G CTAACTT AT TTCCCCTAG 399 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 60 
GCTTCGGAAC AAGAGACCCT GGTTAGACCA AAGCCATTGC TTTTGAAGTT ATTAAAGTCT 120 
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GTTGGTGCAC AAAAAGACAC TTATACTATG AAAGAGGTTC TTTTTTATCT TGGCCAGTAT 180 

ATTATGACTA AACGATTATA TGATGAGAAG CAACAACATA TTGTAAATGA TTGTGCTAAC 24 0 

TTATTTCCCC TAGTTGACCT GTCTATAAGA GAATTATATA TTTCTAACTA TATAACCCTA 300 

GGAATTTAG 309 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CACAGATAAG 


GTTATTTGGG 




AflfiA CTT A a TV 


l_ CvjVjAu A ILCj 


CCCAAAAGGA 


60 


TGAGGTGACT 


AAGAAAGATG 


una rviA ncc c 


1L1 1 1 1 ivjv».A 


MWI TV ^ty-i f"» 

uuL lvjt_aAL»(jv_ 


AC A a AG AT AA 


120 


GAGAATTATC 


ACTCTACATT 


c* a Tf r PT , *r , r* r r r* 

1C1 1 1 \~ X v». 


A A A fl ZVfT TATA T 


l» 1 AC 1 luiul 


vj 1 X 1 TATATT 


180 


TCATTAGAAT 


CGGACAGATG 






LAb AAAvj 1 A I 


T AAACC CAGA 


240 


ACTTAACAAA 


GGTCCATGGA 


pta a ^.n a nn a 

V- x AAAvj>ivjvjM 


ij<jA 1 CAAAvjO 


/"""FTV TAT^TA/"' TV An 


TV f~+F^^Pf> TV TV TV 

ACGTGCAGAA 


300 


ATACGGTCCA AAGCGCTGGT 




• i » TV TV f**/"* TV TTTP 

1 AAov-Al 1 lb 


tv Arir , r , 7A apt 7\ 
AAVjtjVjAAvjvjA 


rr»rriy-i/^ TV TV TV TV <"» 1\ 

1 rCaGAAAACA 


360 


GTGCAGGGAG 


AGGTGGCACA 




tpp tv ta Ti^ ,r ^^ , 

1 UUAvjAAvj 1 Ij 


TV A TA TV TV TV r" 1 r^T 

AAtaAAAAv- L I 


CCTGGACAGA 


420 


AGAGGAAGAT 


AGAATTATTT 


ACCAGGCACA 


CAAGAGACTG 


GGAAACAGAT 


GGGCAGAAAT 


480 


TGCAAAGTTG 


CTGCCTGGAC 


GGACTGATAA 


CGCTGTCAAG 


AACCACTGGA 


ATTCCACCAT 


540 


GCGCCGGAAG 


GTCGAGCAGG 


AGGGTTACCC 


GCAGGAGTCC 


TCCAAAGCCG 


GCCCGCCCTC 


600 


GGCAACCACC 


GGCTTCCAGA 


AGAGCAGCCA 


TCTGATGGCC 


TTTGCCCACA 


ACCCACCTGC 


660 


AGGCCCGCTC 


CCGGGGGCCG 


GCCAGGCCCC 


TCTGGGCAGT 


GACTACCCCT 


ACTACCACAT 


720 


TGCTGAGCCA 


CAAAATGTCC 


CTGGTCAGAT 


CCCATATCCA 


GTAGCACTGC 


ATATAAATAT 


780 


TATCAATGTT 


CCTCAGCCAG 


CTGCTGCAGC 


TATTCAGAGA 


CACTATACTG 


ATGAAGACCC 


840 


TGAGAAAGAA 


AAACGAATAA 


AGGAATTAGA 


GTTGCTACTT 


ATGTCGACTG 


AGAATGAACT 


900 


GAAAGGGCAG 


CAGGCATTAC 


CAACACAGAA 


CCACACAGCA 


AACTACCCCG 


GCTGGCACAG 


960 


CACCACGGTT 


GCTGACAATA 


CCAGGACCAG 


TGGTGACAAT 


GCGCCTGTTT 


CCTGTTTGGG 


1020 


GGAACATCAC 


CACTGTACTC 


CATCTCCACC 


AGTGGATCAT 


GGTTGCTTAC 


CTGAGGAAAG 


1080 


TGCGTCCCCC 


GCACGGTGCA 


TGATTGTTCA 


CCAGAGCAAC 


ATCCTGGATA 


ATGTTAAGAA 


1140 


TCTCTTAGAA 


TTTGCAGAAA 


CACTCCAGTT 


AATAGACTCC 


TTCTTAAACA 


CATCGTCCAA 


1200 


TCACGAGAAT 


CTGAACCTGG 


ACAACCCTGC 


ACTAACCTCC 


ACGCCAGTGT 


GTGGCCACAA 


1260 


GATGTCTGTT 


ACCACCCCAT 


TCCACAAGGA 


CCAGACTTTC 


ACTGAATACA 


GGAAGATGCA 


1320 


CGGCGGAGCA 


GTCTAGAGCT 


CAATTATAAT 


AATCTTGCGA 


ATCGGGCTGT 


AACGGGGCAA 


1380 
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GGCTTGACCG AGGGGACTAT AACATGTATA GGCGAAAAGC GGGGTCTCGG TTGTAACGCG 144 0 

CTTAGGAAGT CCCCTCGAGG TATGGCAGAT ATGCTTTTGC ATAGGGAGGG GGAAATGTAG 1500 

TCTTAATCGT AGGTTAACAT GTATATTACC AAATAAGGGA ATCGCCTGAT GCACCAAATA 156 0 

AGGTATTATA TGATCCCATT GGTGGTGAAG GAGCGACCTG AGGGCATATG GGCGTTAACA 162 0 

GAACTGTCTG TCCTTGCGTC ATTCCTCATC GGATCATGTA CGCGGCAGAG TATGATTGGA 16 80 

TAACAGGATG GCACCATTCA TCGTGGCGCA TGCTGATTGG TGCGACTAAG GAGTTGTGTA 174 0 

ACCCACGAAT GTACTTAAGC TTGTAGTTGC TAACAATAAA GTGCCATTCT ACCTCTCACC 180 0 

ACATTGGTGT GCACCTGGGT TGATGGCCGG ACCGTCGATT CCCTGACGAC TGCGAACACC 186 0 

TGAATGAAGC TGAAGGCTTC AGGTACCCTT ACTTGAT 1897 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8082 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 



AGCTTGTTTG 


GCCGTTTTAG 


GGTTTGTTGG 


AATTTTTTTT 


TCGTCTATGT 


ACTTGTGAAT 


60 


TATTTCACGT 


TTGCCATTAC 


CGGTTCTCCA 


TAGGGTGATG 


TTCATTAGCA 


GTGGTGATAG 


120 


GTTAATTTTC 


ACCATCTCTT 


ATGCGGTTGA 


ATAGTCACCT 


CTGAACCACT 


TTTTCCTCCA 


180 


GTAACTCCTC 


TTTCTTCGGA 


CCTTCTGCAG 


CCAACCTGAA 


AGAATAACAA 


GGAGGTGGCT 


240 


GGAAACTTGT 


TTTAAGGAAC 


CGCCTGTCCT 


TCCCCCGCTG 


GAAACCTTGC 


ACCTCGGACG 


300 


CTCCTGCTCC 


TGCCCCCACC 


TGACCCCCGC 


CCTCGTTGAC 


ATCCAGGCGC 


GATGATCTCT 


360 


GCTGCCAGTA 


GAGGGCACAC 


TTACTTTACT 


TTCGCAAACC 


TGAACGCGGG 


TGCTGCCCAG 


420 


AGAGGGGGCG 


GAGGGAAAGA 


CGCTTTGCAG 


CAAAATCCAG 


CATAGCGATT 


GGTTGCTCCC 


480 


CGCGTTTGCG 


GCAAAGGCCT 


GGAGGCAGGA 


GTAATTTGCA 


ATCCTTAAAG 


CTGAATTGTG 


540 


CAGTGCATCG 


GATTTGGAAG 


CTACTATATT 


CACTTAACAC 


TTGAACGCTG 


AGCTGCAAAC 


600 


TCAACGGGTA 


ATAACCCATC 


TTGAACAGCG 


TACATGCTAT 


ACACACACCC 


CTTTCCCCCG 


660 


AATTGTTTTC 


TCTTTTGGAG 


GTGGTGGAGG 


GAGAGAAAAG 


TTTACTTAAA 


ATGCCTTTGG 


720 


GTGAGGGACC 


AAGGATGAGA 


AGAATGTTTT 


TTGTTTTTCA 


TGCCGTGGAA 


TAACACAAAA 


780 


TAAAAAATCC 


CGAGGGAATA 


TACATTATAT 


ATTAAATATA 


GATCATTTCA 


GGGAGCAAAC 


840 


AAATCATGTG 


TGGGGCTGGG 


CAACTAGCTG 


AGTCGAAGCG 


TAAATAAAAT 


GTGAATACAC 


900 


GTTTGCGGGT 


TACATACAGT 


GCACTTTCAC 


TAGTATTCAG 


AAAAAATTGT 


GAGTCAGTGA 


960 


ACTAGGAAAT 


TAATGCCTGG 


AAGGCAGCCA 


AATTTTAATT 


AGCTCAAGAC 


TCCCCCCCCC 


1020 


CCCCAAAAAA 


AGGCACGGAA 


GTAATACTCC 


TCTCCTCTTC 


TTTGATCAGA 


ATCGATGCAT 


1080 
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1111 lUi O^-rt 


TP 21 PPP r* 7A TT 
I UnLLbLA X 1 


T k / m */^ TV TV *P A A T» A 

i l laa r aat A 


AAA f*r*f*f* TV TV 7\ 

aaaggggaaa 


GAGGACCTGG 


A A 7\ f~*r* 7l TV fprn* 

AAAGGAATTA 


1140 


i-LHLLa ILL LtLj J. 


1 Ibl LLvjLaLiLj 


AGGAAAGAGT 


TAACGGTTTT 


TTTCACAAGG 


GTCTCTGCTG 


1200 


rtL 1 lvLL.LV.UO 


PTPPPTPP 7A P 
L 1 LV3U 1 LLAL 


AAGl 1 L I LlA 


lTTGllllTT 


TTAGGAAGTC 


CGGTCCCGCG 


1260 


Lr 1 1 Luoo lAL 


LLLL 1 OLLLL 


Tll CATATTl 


TCCCGTCTAG 


CACCTTTGAT 


TTCTCCCAAA 


1320 


LLLuuLnvjLL 


L L» AG AC TGTT 


GCaaACCGGC 


GCCACAGGGC 


GCAAAGGGGA 


TTTGTCTCTT 


1380 


/*»TP 7A TV A <"*^"TV^ 

L ibAAALL lu 


bLl bAvjAAAT 


TGGGAACTCC 


GTGTGGGAGG 


CGTGGGGGTG 


GGACGGTGGG 


1440 


fl T 7A C 7A fS 7A HTP 


G LAG AG AG LA 


GGlAACCTCC 


CTCTCGCCCT 


AGCCC71GCTC 


TGGAACAGGC 


1500 


7A f 7A TV f» A TP T* 
AbALALA 1 L 1 


CAGGGCTAAA 


cagacgcctc 


CCGCACGGGG 


CCCCACGGAA GCCTGAGCAG 


1560 


VjLGoVjuLAuo 


AGGGGCGGTA 


tctgctgctt 


TGGCAGCAAA 


TTGGGGGACT 


CAGTCTGGGT 


1620 


GGAAGGTATC 


CAATCCAGAT 


agctgtgcat 


ACATAATGCA 


TAATACATGA 


CTCCCCCCAA 


1680 


PAR TATV^OA TAT* 


GGGAGTTTAT 


TCATAACGCG 


CTCTCCAAGT 


ATACGTGGCA 


ATGCGTTGCT 


1740 


VjLAjI 1A1 111 


AATCATTCTA 


GGCATCGTTT 


TCCTCCTTAT 


GCCTCTATCA 


TTCCTCCCTA 


1800 


TfTTA fTA HTTA 7A 
ILl ALAL 1 AA 


CATCCCACGC 


TCTGAACGCG 


CGCCCATTAA 


TACCCTTCTT 


TCCTCCACTC 


1860 


A LLL IuuVjAL 


TCTTGATCAA AGCGCGGCCC 


TTTCCCCAGC 


CTTAGCGAGG 


CGCCCTGCAG 


1920 


LL luulALbL 


GCGTGGCGTG 


GCGGTGGGCG 


CGCAGTGCGT 


T CT CTGTGTG 


GAGGGCAGCT 


1980 


L»l ILLbLLiu 


CGATGATTTA 


TACTCACAGG 


ACAAGGATGC 


GGTTTGTCAA 


ACAGTACTGC 


2040 


T & PP 1*2 TV TA /"2 
1 aL V3 UlAubAu 


CAGCAGAGAA 


AGGGAGAGGG 


TTTGAGAGGG 


AGCAAAAGAA 


AATGGTAGGC 


2100 


uLuLbiAul 1 


AATTCATGCG 


GCTCTCTTAC 


TCTGTTTACA 


TCCTAGAGCT 


AGAGTGCTCG 


2160 


LjL IuLLLuuL 


TGAGTCTCCT 


CCCCACCTTC 


CCCACCCTCC 


CCACCCTCCC 


CATAAGCGCC 


2220 


LL 4. LLLvjvjLj 1 


TCCCAAAGCA 


GAGGGCGTGG 


GGGAAAAGAA 


AAAAGATCCT 


CTCTCGCTAA 


2280 


TPTPrPPPr TV 
1 L 1 LLuLLLA 


CCGGCCCTTT 


ATAATGCGAG 


GGTCTGGACG 


GCTGAGGACC 


CCCGAGCTGT 


2340 


LiL 1 LjL 1 LGlCj 


GCCGCCACCG 


CCGGGCCCCG 


GCCGTCCCTG 


GCTCCCCTCC 


TGCCTCGAGA 


2400 


7a r^Ciczr* 7a czr^cr* 

M.VavjoLAv3VaVjL 


TTCTCAGAGG 


CTTGGCGGGA 


AAAAGAACGG 


AGGGAGGGAT 


CGCGCTGAGT 


2460 


7AT7A 7A 7A 7A rrrTT* 


GTTTTCGGGG 


CTTTATCTAA 


CTCGCTGTAG 


TAATTCCAGC 


GAG AGG CAGA 


2520 


nf^OTAfSPfSTAnP 

VJv3\7/lv3Lv3>iL7L 


GGGCGGCCGG 


CTAGGGTGGA 


AG AGC CGGG C 


GAGCAGAGCT 


GCGCTGCGGG 


2580 


CGTCCTGGGA 


AGGGAGATCC 


GGAGCGAATA 


GGGGGCTTCG 


CCTCTGGCCC 


AGCCCTCCCG 


2640 


CTGATCCCCC 


AGCCAGCGGT 


CCGCAACCCT 


TGCCGCATCC 


ACGAAACTTT 


GCCCATAGCA 


2700 


GCGGGCGGGC 


ACTTTGCACT 


GGAACTTACA 


ACACCCGAGC 


AAGGACGCGA 


CTCTCCCGAC 


2760 


GCGGGGAGGC 


TATTCTGCCC 


ATTTGGGGAC 


ACTTCCCCGC 


CGCTGCCAGG 


ACCCGCTTCT 


2820 


CTGAAAGGCT 


CTCCTTGCAG 


CTGCTTAGAC 


GCTGGATTTT 


TTTCGGGTAG 


TGGAAAACCA 


2880 


GGTAAGCACC 


GAAGTCCACT 


TGCCTTTTAA 


TTTATTTTTT 


TATCACTTTA 


ATGCTGAGAT 


2940 


GAGTCGAATG 


CCTAAATAGG 


GTGTCTTTTC 


TCCCATTCCT 


GCGCTATTGA 


CACTTTTCTC 


3000 
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AGAGTAGTTA 


TGGTAACTGG 


GGCTGGGGTG 


GGGGGTAATC 


CAGAACTGGA 


TCGGGGTAAA 


3060 


GTGACTTGTC 


AAGATGGGAG 


AGGAGAAGGC 


AGAGGGAAAA 


CGGGAATGGT 


TTTTAAGACT 


3120 


ACCCTTTCGA 


GATTTCTGCC 


TTATGAATAT 


ATTCACGCTG 


ACTCCCGGCC 


GGTCGGACAT 


3180 


TCCTGCTTTA 


TTGTGTTAAT 


TGCTCTCTGG 


GTTTTGGGGG 


GCTGGGGGTT 


GCTTTGCGGT 


3240 


GGGCAGAAAG 


CCCCTTGCAT 


CCTGAGCTCC 


TTGGAGTAGG 


GACCGCATAT 


CGCCTGTGTG 


3300 


AGCCAGATCG 


CTCCGCAGCC 


GCTGACTTGT 


CCCCGTCTCC 


GGGAGGGCAT 


TTAAATTTCG 


3360 


GCTCACCGCA 


TTTCTGACAG 


CCGGAGACGG 


ACACTGCGGC 


GCGTCCCGCC 


CGCCTGTCCC 


3420 


CGCGGCGATT 


CCAACCCGCC 


CTGATCCTTT 


TAAGAAGTTG 


GCATTTGGCT 


TTTTAAAAAG 


3480 


CAATAATACA 


ATTTAAAACC 


TGGGTCTCTA 


GAGGTGTTAG 


GACGTGGTGT 


TGGGTAGGCG 


3540 


CAGGCAGGGG 


AAAAGGGAGG 


CGAGGATGTG 


TCCGATTCTC 


CTGGAATCGT 


TGACTTGGAA 


3600 


AAACCAGGGC 


GAATCTCCGC 


ACCCAGCCCT 


GACTCCCCTG 


CCGCGGCCGC 


CCTCGGGTGT 


3660 


CCTCGCGCCC 


GAGATGCGGA 


GGAACTGCGA 


GGAGCGGGGC 


TCTGGGCGGT 


TCCAGAACAG 


3720 


CTGCTACCCT 


TGGTGGGGTG 


GCTCCGGGGG 


AGGTATCGCA 


GCGGGGTCTC 


TGGCGCAGTT 


3780 


GCATCTCCGT 


ATTGAGTGCG 


AAGGGAGGTG 


CCCCTATTAT 


TATTTGACAC 


CCCCCTTGTA 


3840 


TTTATGGAGG 


GGTGTTAAAG 


CCCGCGGCTG 


AGCTCGCCAC 


TCCAGCCGGC 


GAGAGAAAGA 


3900 


AGAAAAGCTG 


GCAAAAGGAG 


TGTTGGACGG 


GGGCGGTACT 


GGGGGTGGGG 


ACGGGGGCGG 


3960 


TGGAGAGGGA 


AGGTTGGGAG 


GGGCTGCGGT 


GCCGGCGGGG 


GTAGGAGAGC 


GGCTAGGGCG 


4020 


CGAGTGGGAA 


CAGCCGCAGC 


GGAGGGGCCC 


CGGCGCGGAG 


CGGGGTTCAC 


GCAGCCGCTA 


4080 


GCGCCCAGGC 


GCCTCTCGCC 


TTCTCCTTCA 


GGTGGCGCAA 


AACTTTGTGC 


CTTGGATTTT 


4140 


GGCAAATTGT 


TTTCCTCACC 


GCCACCTCCC 


GCGGCTTCTT 


AAGGGCGCCA 


GGGCCGATTT 


4200 


CGATTCCTCT 


GCCGCTGCGG 


GGCCGACTCC 


CGGGCTTTGC 


GCTCCGGGCT 


CCCGGGGGAG 


4260 


CGGGGGCTCG 


GCGGGCACCA 


AGCCGCTGGT 


TCACTAAGTG 


CGTCTCCGAG 


ATAGCAGGGG 


4320 


ACTGTCCAAA 


GGGGGTGAAA 


GGGTGCTCCC 


TTTATTCCCC 


CACCAAGACC 


ACCCAGCCGC 


4380 


TTTAGGGGAT 


AGCTCTGCAA 


GGGGAGAGGT 


TCGGGACTGT 


GGCGCGCACT 


GCGCGCTGCG 


4440 


CCAGGTTTCC 


GCAC CAAGAC 


CCCTTTAACT 


CAAGACTGCC 


TCCCGCTTTG 


TGTGCCCCGC 


4500 


TCCAGCAGCC 


TCCCGCGACG 


ATGCCCCTCA 


ACGTTAGCTT 


CACCAACAGG 


AACTATGACC 


4560 


TCGACTACGA 


CTCGGTGCAG 


CCGTATTTCT 


ACTGCGACGA 


GGAGGAGAAC 


TTCTACCAGC 


4620 


AGCAGCAGCA 


GAGCGAGCTG 


CAGCCCCCGG 


CGCCCAGCGA 


GGATATCTGG 


AAGAAATTCG 


4680 


AGCTGCTGCC 


CACCCCGCCC 


CTGTCCCCTA 


GCCGCCGCTC 


CGGGCTCTGC 


TCGCCCTCCT 


4740 


ACGTTGCGGT 


CACACCCTTC 


TCCCTTCGGG 


GAGACAACGA 


CGGCGGTGGC 


GGGAGCTTCT 


4800 


CCACGGCCGA 


CCAGCTGGAG 


ATGGTGACCG 


AGCTGCTGGG 


AGGAGACATG 


GTGAACCAGA 


4860 


GTTTCATCTG 


CGACCCGGAC 


GACGAGACCT 


TCATCAAAAA 


CATCATCATC 


CAGGACTGTA 


4920 
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TGTGGAGCGG 


f*^ 1 t " i ' r " { 

Li ILi LuuLL 




1 v»uf 1L1 V_/*\j>v 


Orti-VVJV* i. OVJv_ v_ 


TCCTACCAGG 


4980 


CTGCGCGCAA 


AG AL AG L GGL, 


AGCCCGAAH- 


LLv3V,LLuv.ub 






S040 


CC AGCT TGT A 


C CTGCAGGAT 


CTG AGCGC CCa 




vj I vjV-M. i. Cvx/kv_ 




no 

J X \J \J 


TCTTCCCCTA 


CCLT L 1 CAAC 


GACAGCAGL I 


CVvL. L. u AAij i U 




LArtbnL X LLA 


«r 1 fin 


^» A9 ah ^ht79h ahj fi A9 

GCGCCTTCTC 


TCCGTCCTCG 


GATTCTCTGC 




ouALi 1 l~v~ ILL 


L LLsL ALtvsVjL A 


D Z. £ \J 


a^ A9) a^ a^ A9 9k ah ah ah 

GCCCCGAGCC 


CCTGGTGCTC 


/-l TV fTT^TTA TV T\ 

CATGAGGAGA 


CACCGCCCAu 


LAL L Avj L Alj L, 


0 AL 1 L 1 Lavs 1 A 


p n 


9k AHAHAHHl 9k AHAHA9A^ 

AGCGAAGCCC 


GCCCAGGCCT 


/*»TV^"»TV TV TV TV^TT*^ 

GTCAAAAGTG 




1ALL1 X ILLL 


A 1 1 1 1 LA i 1 La 




GCAGCTTATT 


TAACGGGCCA 


CTCTTATTAG 


GAAGGAGAGA 


1 ALaL AbA 1 H 


GGAGAGA1 J. 1 


33 U U 


^» m jM ^ft#vk ft #m 

GGGAGCTCAT 


^"1 TV ^l/Tfl/'Mn/i « TV 

CACCTCTGAA 


ACCTTGGGCT 


TTAGCGTTTC 


CTCC C ATCC C 


TTClL lTTAG 


t £T r> 


9k A^m A** AH A^ AH Ik Ift^ 

ACTGCCCATG 


TTTGCAGCCC 


CCCTCCCCGT 


fTTfTT/*Tni/^»IV/^^/*TTV 

TTG TCTC C C A 


CCCCTCAGGA 


A 1 1 1 LA J. 1 1 A 




GGTTTTTAAA 


CCTTCTGGCT 


9k f^H^H^nf9rt9k ^( TV T\ 

T ATCT TAC AA 


/vmrtll TAWTO/"*>\ 

CTCAATCCAC 


ULilLi 1AL 


LILLLLjI iaa 


ccflrt 
DjOU 


CAT TTT AATT 


GCCCTGGGGC 


GGGGTGGCAG 


GGAGTGTAI (j 


AA 1 LjALjoA i A 


t,pt,p7\ nr* a tt 

Avj AoALaLaA 1 1 


J04U 


GATC TC TGAG 


TV /"l T»/T TV TV fTV/T TV TV 

AGTGAATGAA 


^^^f^^^^t ^^^^T9f^T9^^^ 

TTGCTTCCCT 


CTTAACTTLL 


tv /r tv tv rjrppr"P 
L7AL5AA0 X Vjvj I 


LaLaLaAl 1 i AA 1 


D / U Lr 


^•i ft ft fv« ft ft 

GAACTATCTA 


AH 9k 91 9k 9k 91 CY9AH 9k AH 

CAAAAATGAG 


GGGCTGTGTT 


rn tv TV /rnrnnn /~t 

TAGAGGCTAG 


GlAGGGLL I G 


LL I GAG 1 GLLa 


D / O U 


GG AG CCAGTG 


Tv TV PTIO^^'wiip TV 

AACTGCCTCA 


tv t\ rim/T/T/rmri 

AGAGTGGGTG 


GGCTGAQsGAka 


L i GvjLjAI Lll 


L 1 LALjLL 1 A 1 




TTTGAACACT 


OTt A T> Tl/T TV TV TV 

GAAAAGCAAA 


TCCTTGCCAA 


^k AH '^^'^9^*' AH 9k ^H ^^9AW 

AGTTGGA C TT 


f9Hf^Wi^P^AAT?AkH "HH»fH^^^ AH 


1 1 1A1 ILL 1 i 


CQQfl 
DOOU 


CCCCCGCCCT 


CTTGGACTTT 


fTT/ 1 ! ^"1 T\ TV TV TV ^rrv 

TGGCAAAACT 


AH AH ^\ 9k ^^^A^9A^9 f^H A^9t7^H 

G CAATTTTTT 


f9HA^?*9HiHHi^pfH^r9H9k IHHfH^ 

Till li IA1 I 


111 LAI 1 1 LL 


c q a n 


ft A m-i k ft % « m * ^-m 

AGTAAAATAG 


fm ft F99f9H AH AHAf9 9k 

GGAGTTGCTA 


ft ft ^9j#n ^9j 9k rn ft ah ah 

AAGTCATACC 


ft ft AH A9 9k 9\ f9HAOAH 

AAGCAATTTG 


tv TV m^TV 

CAG CTATCAT 


TTG LAALAL L 


buuu 


TGAAGTGTTC 


#99 inn ah m ft ft ft r99 

TTGGTAAAGT 


A4A( AtfflAt9. ft ft ft ft 

CCCTCAAAAA 


T AGG AGG TG C 


TTGGGAATG T 


AH A"9 f9^f9^^Tl^i A9 ArT9f9HAP* 

GL 111 GL 111 


c 0 c n 
bubU 


GGGTGTGTCC 


9k 9k 9k ah ah ah rr9 ah 94 m 

AAAGCCTCAT 


«¥l 9| ft AHfT9 AHfT9JCT9 9k AH 

TAAGTCTTAG 


AH fn 9k 9k AH 9k 9k M lAfHAH 

GTAAGAATTG 


GCATCAATGT 


ClTATLL 1 GG 




ah 9k 9k ah mnt^ /~* w a> 

GAAGTTGCAC 


TTTTCTTGTC 


A9> 9k ^T9AH AH AH 94 fTl 94 ft 

CATGC CAT AA 


AH AH AH 9k A«AifflAim/9 

CCCAGCTGTC 


fH9J^r9fH9 AH A^ AH ^1*^^^^^^ ^k 

TTTC C C TTT A 


TV A A Pi»pptTT 

TGAGAL I LI 1 


6 Id U 


9k «9mm/* 9k mrt/*i 

ACCTTCATGG 


TGAGAGGAGT 


ft ft ^ft A^B#n^^ A^ A^IPVV 

AAGGGTGGCT 


iTi/i Aimik ah 9k mmn 

GG CTAG ATTG 


GTTCTTTTTT 


TTTTTTTTTC 


^ ^ >i rv 
624 0 


CTTTT TT AAG 


t\ /"» TV m/' ifTT/*i 

ACGGAGTCTC 


ACTCTGTCAC 


T AGG CTGG AG 


I bLALi L bbLb 


/-I tv A TT* A JIPPT 

LAAI LAALL 1 


c "j n n 


CCAACCCCCT 


GGTTCAAGAG 


ft AH^lH AHAHfV9AH AH 

ATTCTCCTGC 


AHfT9AH9k AH AH / 91 19 A9 AH 

CTCAGCCTCC 


C AAG TAG CTG 


GG AC TAC AGG 


"3 O 

0 JbU 


FTV»^"»TV /"(TV, /^/^ TV 

TG CAC ACCAC 


^9 9k fjys^ <*9 9k AH ^9 ^9 

CATGCCAGGC 


TAATTTTTGT 


AATTTTAGTA 


GAG AT. GGGG I 


I^H^IH^H 9k l^HAH^HlHH^H|9H 

1 ILAi LLalGl 


c ii 0 n 


TGGCCAGGAT 


GGTCTCTCCT 


GACCTCACGA 


TCCGCCCACC 


TCGGCCTCCC 


AAAGTGCTGG 


6480 


GATTACAGGT 


GTGAGCCAGG 


GCACCAGGCT 


TAGATGTGGC 


TCTTTGGGGA 


GATAATTTTG 


6540 


TCCAGAGACC 


TTTCTAACGT 


ATTCATGCCT 


TGTATTTGTA 


CAGCATTAAT 


CTGGTAATTG 


6600 


ATTATTTTAA 


TGTAACCTTG 


CTAAAGGAGT 


GATTTCTATT 


TCCTTTCTTA 


AAGAGGAGGA 


6660 


ACAAGAAGAT 


GAGGAAGAAA 


TCGATGTTGT 


TTCTGTGGAA 


AAGAGGCAGG 


CTCCTGGCAA 


6720 


AAGGTCAGAG 


TCTGGATCAC 


CTTCTGCTGG 


AGGCCACAGC 


AAACCTCCTC 


ACAGCCCACT 


6780 


GGTCCTCAAG 


AGGTGCCACG 


TCTCCACACA 


TCAGCACAAC 


TACGCAGCGC 


CTCCCTCCAC 


6840 
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TCGGAAGGAC 


TATCCTGCTG 


CCAAGAGGGT 


CAAGTTGGAC 


AGTGTCAGAG 


TCCTGAGACA 


6900 


GATCAGCAAC 


AACCGAAAAT 


GCACCAGCCC 


CAGGTCCTCG 


GACACCGAGG 


AGAATGTCAA 


6960 


GAGGCGAACA 


CACAACGTCT 


TGGAGCGCCA 


GAGGAGGAAC 


GAGCTAAAAC 


GGAGCTTTTT 


7020 


TGCCCTGCGT 


GACCAGATCC 


CGGAGTTGGA 


AAACAATGAA 


AAGGCCCCCA 


AGGTAGTTAT 


7080 


CCTTAAAAAA 


GCCACAGCAT 


ACATCCTGTC 


CGTCCAAGCA 


GAGGAGCAAA AGCTCATTTC 


7140 


TGAAGAGGAC 


TTGTTGCGGA 


AACGACGAGA 


ACAGTTGAAA 


CACAAACTTG 


AACAGCTACG 


7200 


GAACTCTTGT 


GCGTAAGGAA 


AAGTAAGGAA 


AACGATTCCT 


TCTAACAGAA 


ATGTCCTGAG 


7260 


CAATCACCTA 


TGAACTTGTT 


TCAAATGCAT 


GATCAAATGC 


AACCTCACAA 


CCTTGGCTGA 


7320 


GTCTTGAGAC 


TGAAAGATTT 


AGCCATAATG 


TAAACTGCCT 


CAAATTGGAC 


TTTGGGCATA 


7380 


AAAGAACTTT 


TTTATGCTTA 


CCATCTTTTT 


TTTTTCTTTA 


ACAGATTTGT 


ATTTAAGAAT 


7440 


TGTTTTTAAA 


AAATTTTAAG 


ATTTACACAA 


TGTTTCTCTG 


TAAATATTGC 


CATTAAATGT 


7500 


AAATAACTTT 


AATAAAACGT 


TTATAGCAGT 


TACACAGAAT 


TTCAATCCTA 


GTATATAGTA 


7560 


CCTAGTATTA 


TAGGTACTAT 


AAACCCTAAT 


TTTTTTTATT 


TAAGTACATT 


TTGCTTTTTA 


7620 


AAGTTGATTT 


TTTTCTATTG 


TTTTTAGAAA 


AAATAAAATA 


ACTGGCAAAT 


ATATCATTGA 


7680 


GCCAAATCTT 


AAGTTGTGAA 


TGTTTTGTTT 


CGTTTCTTCC 


CCCTCCCAAC 


CACCACCATC 


7740 


CCTGTTTGTT 


TTCATCAATT 


GCCCCTTCAG 


AGGGCGGTCT 


TAAGAAAGGC 


AAGAGTTTTC 


7800 


CTCTGTTGAA 


ATGGGTCTGG 


GGGCCTTAAG 


GTCTTTAAGT 


TCTTGGAGGT 


TCTAAGATGC 


7860 


TTCCTGGAGA 


CTATGATAAC 


AGCCAGAGTT 


GACAGTTAGA 


AGGAATGGCA 


GAAGGCAGGT 


7920 


GAGAAGGTGA 


GAGGTAGGCA 


AAGGAGATAC 


AAGAGGTCAA 


AGGTAGCAGT 


TAAGTACACA 


7980 


AAGAGGCATA 


AGGACTGGGG 


AGTTGGGAGG 


AAGGTGAGGA 


AGAAACTCCT 


GTTACTTTAG 


8040 


TTAACCAGTG 


CCAGTCCCCT 


GCTCACTCCA 


AACCCAGGAA 


TT 




8082 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

AGGGTTACAC GTCTTAACTC AGAGTTGCAA CAGGCTTGAA CAAGCCCAGG CACGCCCAGA 6 0 

TAC CTAGGGC CGAGTCACCG TTAAAACTAA CAGACCATAA AAGGAAAGGA ATACAGAACA 120 

GACTAGGAGT ACCGGATCTG ACTCACAGGC CACCTGGCAG GAAGAGATAA GCCCCAGCCC 180 

CCGACATTCA GGACGTCCCA GCCCGCACGT ACTCTTACCA TGTTACAACC TCATTCGAAT 24 0 

ATGATTCAAA CCTGCCAATG TGTGTAGCTA TACCTTATCA CCTCATCTTG TGAAATAACC 300 

AATCATATGT GAACATGTCT ATATGCTTCG TTTAAATCCA CCAATCCCCG TAACTATGCA 360 
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TCTGCTTCTG 


TACGCCCGCT 


TCTGCTTCCC 


CAAACCCTAT 


AAAAGCCCCA 


TGCTAGAGCT 


420 


GTTGGGCGCG 


CAAGTCCTCC 


GAAGAGACTG 


TGTGCCCGCA 


GGTACCTGTG 


TTTTPPAATA 

x x x x w Ln^v x n 


4 80 

*x O \J 


AACCCTCTTG 


CTGATTGCAT 


CCGAGTGGCC 


TCGGCTCGGT 


CATTGGGPGP 

wj^x x \jwwwvj\— 


TTGGGGGTPT 

X 1. ww www X L X 


3 *m U 


CCTCCTGAGG 


GAAAGGTCCT 


CTCCGGAGGT 


CTTTTCATTT 

^» X X X X wn X X X 


TGGGGGCTPG 

X W W W WW w X wVJ 


TPPGGG A TPT 

X LLOwwn 1 L X 




GGAGATCCTC 


CGCCCAGAGA 


TCACCGACCA 


CCCACCGGGA 

w uv«av w ww on 


GGTAAGPPGG 

WW X AAULLUw 


PPGGPATPTG 
L L wwLn X L i. w 


ODD 


TCGTGTCTTG 


CCCTGTCTTG 




TCCTGTGCGC 


GTGTTP AGTT 

w X O X X w/-lw X X 


L w X L X Lnw X X 


T 0 A 


T TGG APTP AG 


ATCTGGGTTT 


TGGTPGAAGG 


AGAAGGPrPA 


GGGPTTPGf2T 
www w X X wO w X 


X X L 1 LnUUb X 


*7 Q A 


TP. AG GAP PPT 


PAGPGPPTPP 


Ul x x UvUvUVJ 


GTPAGAGl\aG 


g & n ptg 2\ rr & 

urtuL 1 union 


VjL X LUkjAL 1 I 


Q a rs 
840 


CTCCCCCCGC 


AGCCPTGGAA 

w w w x vjvjrtri 


GAPGTTPPAA 


WJ X w X w 1 uu 


ZiGPPPGGTTP 
nuLLLuu 1 XL 


1 X X buuov. X L 


nnn 


AGCCCGTATC 


GGAGGGATAP 


w X ww X X X X ww 




GGGTPP HGP A 
ouo X LLnwwn 


LLl 1 LouLAl 


Q C A 

y o u 


CTCCATCTGA 


CTCTTTGTTT 


TGf3f? TTTT A C 

x www x x X X nw 


GT PG A AGPPG 


LuLUOLULU 1 


Llwi.LJ.wl In 


1 A O A 


TTTGTCTGAT 


CGTTGGATTT 


GTCTGTCTAA 

W X W X w 1W1 


TPTGTG P PPT 


21 21 TTTTPTTT 
nn xlXXLXXX 


GA AGPTAPPA 
wnnwL XnLLn 


1 uo u 


TGGGACAATC 


GCTAACAACC 


v« w w i x wn\] X %- 


TP 21 PTPT 


PP ATTGG A 2iG 
LLni X wwnno 


GAPf-TPPf-Ar' 


1 1 A A 


AC CGAGCACG 


TGATCAGTCG 


GTCGAGATCA 


AGAAAGGTPP 


TPTPPGGAGG 

X L X LLVSUnUU 


TPGGGGAPAfl 


1 *3 A A 


TCGCGCCAGC 


AAGCGGTGGG 


GCAGGAGCTC 


PTGGTTTGG P 

V— X VJ\J XXX W\JL 


AGPPPPTGTA 

nvj w L L L X W X /A 


GAAGPGATGA 
wnn w L wn X on 


1 O^A 


CAGAATACAA 


GCTTGTGGTG 


GTGGGCGCTA 


GAGG PGTGGG 

w^ivjrvj w x www 


AAAG AG TG PP 

nnnVJMU X wL L 


PTGAPPATPP 

L X wnL Ln ILL 


1 -ion 


AGCTGATCCA 


GAACCATTTT 


GTGGACGAGT 


ATGATPPPAP 


TATAGAGGAP 
X n X nunUUnL 


TPPTAPPGGA 

X L L X nL L w wn 


1 7 PA 


AACAGGTAGT 


CATTGATOfiG 


Gl\Cl IV CGTGTT 
UnVJMV. w lull 


T H PTGG z\ r* 2i T 


L x 1 AvjALnLn 


bLAby X LAfty 


t ji a n 


AAGAGTATAG 


TG CC ATG CGG 




TGPGPZiPZxGG 

X OLVJLftL>lO\3 


Uyhuuul XXL 


L 1 L X w 1 \J 1 n 1 


lbUU 


TTGCCATCAA 


CAACACCAAG 


i v_ w x J. l unrtu 


>iLn X L w A x wM 


w X nLAuuunu 


PAf_ ATPa APP 
LAvjA X LAAul 


1 be U 


GGGTGAAAGA 


TTCAGATGAT 


GTG C C A ATHf? 

\7 x OV.Lnn X ww 


TGPTGG TGGG 
x ww x wur X wu 


PH 21 P a nrjTr^T 


p a pptppppp 

Onl L 1 VJOLLI? 


T C "5 A 


CTCACACTGT 


TGAGTCTCGG 




APPTTGPTPG 
aww X 1 bt X ww 


P A GPT a Trie n 

LAVJL X nX UuL 


RTOPPPTRPA 
n X L LLL X ALA 


i < o n 


TTGAAACATC 


AGCCAAGACC 

w^mun\^ Vw» 


CGAPPAGGTR 


TGGAGGATGP 

X VJUftOuft X ww 


PTTPTAPAPA 
L X X L X nLnLn 


PTHPTBPPTP 
L X nw X ALU X kj 


T *7 A A 


AGATTCGGCA 


GCATAAACTG 


CGGAAAPTGA 

^-ww*W*V» X wrt 


APPPGPPTGA 

www ww X w/\ 


TGAGAGTGGP 
X ununo X uoL 


PPTPPPTPP A 
LLlbuL 1 uLA 


IbUU 


TGAGCTGCAA 


GTGTGTGCTG 


TC CTGACAC C 


AGGTTAAGGA 


PPTG ITTTTr 

LLXwnX X X XL 


PGPPAGA BP.P 
L wLLnVjnnwL 


i o/rn 
iobU 


CGTACGGACA 


CCCTGACCAG 


GTGGCCTACA 


TTGTCACCTG 


GGAGAGCTTG 


GCATTTAGCC 


1920 


CTCCTCCTTG 


GGCAGAACCC 


TTTGTGGACC 


CGAATTGGCT 


TCCTGTTTCC 


CCTAAACCTG 


1980 


TTTCCCCGAG 


CCCACCTGAC 


CCTTTGGTTG 


CTTCTTCCTC 


TCTCTATCCT 


GCTCTAACTA 


2040 


AGGAAGAATC 


TCCCAAAGTC 


CCTCCCCCGA 


AACCTGTCCT 


CCCAGAGGAC 


CCAAATTCCC 


2100 


CCCTTATAGA 


TCTCCTGTTG 


GAAGAACCTC 


CTCCGTACCC 


TGTACCTACA 


GCCCCGCCAA 


2160 


GAGAAGAGGA 


AGTGGAGCCG 


CCTGCTAGAC 


CTCGACTCGA 


GGCGGCCCCT 


TCCCCTGTGG 


2220 


CTGGAAGACT 


TCGGGGACGA 


CGCGAGGTGG 


CGCCAGACTC 


CACCTCCCAG 


GCCTTTCCGC 


2280 
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TTAGACAAGG 


GGCTGGCGGC 


CAGATACAAT 


ACTGGCCATT 


CTCAGCGGCC 


GACATATATA 


2340 


ACTGGAAACA 


ACACAACCCC 


CCCTTTTCTA 


AGGATCCGGT 


GGCTCTCACC 


AACCAGATAG 


2400 


AATCTGTCTT 


GCTTACCCAT 


CAGCCCACTT 


GGGATGATAT 


ACAGCAACTT 


TTACAGGCCC 


2460 


TCCTGACCTC 


TGAAGAGAAG 


CAGAGAGTGC 


TCTTAGAGGC 


CAGGAAACAT 


GTTTTGGGGG 


2520 


ACAATGGACG 


CCCCACCTTG 


CTCCCGAAAG 


AGATCGATGA 


TGCATTCCCA 


CTTACAAGAC 


2580 


CTGATTGGGA 


TTTCACCACG 


GCTAAAGGTA 


GGAGACACCT 


ACGCCTTTAT 


CGCCAGTTGC 


2640 


TCCTAGCGGG 


TCTCCGAGGG 


GCGGCACGAC 


GCCCCACCAA 


TTTGGCTCAG 


GTAAAACAAG 


2700 


TGGTACAAGA 


GGCTGCGGAG 


ACTCCCTCAG 


CCTTCCTAGA 


GAGACTTAAG 


GAAGCTTATC 


2760 


GCATGTATAC 


CCCTTATGAT 


CCAGATGATC 


CAGGACAAAT 


GACAAATGTC 


TCCATGTCCT 


2820 


TCATCTGGCA 


GGCAGCACCA 


GATATCAGGG 


CCAAGCTACA 


GAGAATAGAA 


AATTTACAAG 


2880 


GGTATACACT 


GCAGGATTTA 


CTTAAGGAGG 


CAGAAAGAAT 


TTATAACAAG 


AGAGAGACAC 


2940 


AAGAAGAAAA 


GAAAGATAAA 


ATACGTAGAG 


AAAAAGATGA 


GAGAGACCGA 


AAAAGAAACA 


3000 


GAGAGTTGAG 


TCGAATCTTG 


GCCGCCGTAG 


TTCAGGGTCA 


AGAGAAAAGG 


GGAGAGAGGG 


3060 


TGGGAGTTCG 


AAAGGGGCCA 


AAGCTAGATA 


AGGATCAATG 


TGCGTATTGC 


AAAGAAAGAG 


3120 


GACACTGGGC 


CAGAGATTGC 


CCTAAGAAAC 


CCAGCGGCTC 


CGAAGACCCC 


GCCCACAGAC 


3180 


CTCCCTCTTG 


GCCCTAGATA 


AAGATTAGGG 


AGGTCAGGGC 


CAGGAGCCCC 


CCCCTGAGCC 


3240 


CAGGATAACT 


CTTGAAGTTG 


GGGGGCAGCC 


AGTCACCTTT 


CTGGTGGACA 


CAGGAGCCCA 


3300 


GCACTCAGTC 


CTCACCCAGG 


CCCCTGGACA 


ACTCAGCGAC 


CGGACGGCCT 


GGGTACAAGG 


3360 


AGCCACTGGC 


AGCAAGAGAT 


ACCGTTGGAC 


TACAGATCGA 


CGGGTTCAGC 


TGGCTACTGG 


3420 


TAAGGTGACC 


CATTCCTTCT 


TACATGTTCC 


GGACTGCCCA 


TACCCTCTGC 


TGGGCCGTGA 


3480 


CTTGCTTACC 


AAATTAAAAG 


CTCAGATCCA 


TTTTGAAGAA 


GGAGGGACCC 


GAGTAACCGG 


3540 


GCCCCGCGGT 


ATTCCTCTTC 


AGATTTTAAC 


CCTTCAGTTA 


GAAGATGAAT 


ATAGATTATA 


3600 


TGAACCAGAA 


CAGGACAAGC 


CAAAATCTCC 


AGAAATAGAC 


TCTTGGGTCA 


CGAAATTCCC 


3660 


ACTGGCCTGG 


GCAGAGACTG 


GCGGGATGGG 


GTTGGCGCTC 


CAACAGCCTC 


CCCTAATTAT 


3720 


CCAGTTAAAG 


GCCACCGCGA 


CTCCTGTCTC 


CATTAAACAG 


TACCCCATGT 


CATGGGAAGC 


3780 


TTATCAGGGC 


ATAAAGCCAC 


ATATCAGGAG 


GCTCTTAGAC 


CAAGGCATCC 


TAGTCCCTTG 


3840 


CCGGTCACCC 


TGGAATACGC 


CTCTGCTACC 


TGTTAAGAAG 


CCCGGCACTG 


GAGACTATAG 


3900 


GCCAGTACAA 


GATTTGAGAG 


AGGTCAACAA 


AAGAGTAGAA 


GATATTCATC 


CAACTGTCCC 


3960 


AAACCCTTAT 


AACCTACTCA 


GCACCCTGCC 


TCCCACCCAT 


ACTTGGTATA 


CGGTCTTAGA 


4020 


TCTGAAGGAT 


GCTTTCTTCT 


GCCTCCGGCT 


GAGCCCAGAA 


AGCCAGCCCT 


TATTTGCTTT 


4080 


TGAGTGGAAA 


GACTCTGAAA 


TGGGGCTTTC 


GGGACAGTTG 


ACTTGGACAA 


GGTTACCACA 


4140 


GGGTTTCAAA 


AACAGCCCAA 


CGCTCTTTGA 


TGAGGCCTTA 


CACCGGGACT 


TGGCTGACTT 


4200 
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TCGAGTCCAG CATCCCACTC TTATACTTCT TCAGTTTGTT GATGACCTTC TTCTAGGGGC 4260 

CACTTCTGAG ACAGCATGCC ACCAGGGAAC AGAATCCCTC TTGCAGACTT TGGGGCGATT 432 0 

GGGCTATCGA GCTTCTGCCA GAAAGGCTCA AATTTGCCAG ACCCAGGTTA CTTATTTAGG 43 8 0 

CTATCAACTA AGGGATGGAC AGCGATGGCT GACTCCGGCT AGGAAACAGA CCGTGGCCAA 4440 

CATCCCAGCC CCAAGAAATG GCCGACAGCT ACGGGAATTC 4480 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GCTGAGTAGT GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGGC CGACAATTGC 6 0 

ATGAAGAATC TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 12 0 

ACGCGTATCT GAGGGGACTA GGGTGTGTTT AGGCGAAAAG CGGGGCTTCG GTTGTACGCG 180 

GTTAGGAGTC CCCTCAGGAT ATAGTAGTTT CGCTTTTGCA TAGGGAAGGG GAAATGTAGT 24 0 

CTTATGCAAT ACTCTTGTAG TCTTGCAACA TGCTTATGTA ACGATGAGTT AGCAACATGC 300 

CTTACAAGGA GAGAAAAAGC ACCGTGCATG CCGATTGGTG GAAGTAAGGT GGTACGATCG 36 0 

TGCCTTATTA GGAAGGCAAC AGACGGGTCT GACATGGATT GGACGAACCA CCGAATTCCG 420 

CATTGCAGAG ATATTGTATT TAAGTGCCTA GCTCGATACA ATAAACGCCA TTTGACCATT 480 

CACCACATTG GTGTGCACCT GGGTTGATGG CCGGACCGTT GATTCCCTGA CGACTACGAG 54 0 

CACCTGCATG AAGCAGAAGG CTTCA 56 5 

* 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

GGATCCTCAG GGGTAACACC TTTTGGAGGT GGGCATCTTC CTCATTCTCA GTGGTGCCAA 60 

GTTCATATCC TGCTGGCTTA ACACGTGGTG TTACTATATT TGTGGCCTTA TATGATTATG 120 

AAGCTAGAAC TACAGAAGAC CTTTCATTTA AGAAGGGTGA AAAATTTCAA ATAATTAACA 180 

ATACAGAAGG AGACTGGTGG GAAGCAAGAT CAATCACTAC AGGAAAGAAT GGTTATATCC 24 0 

TGAGCAGTTA TGTAGCGCCT GCAGATTCCA TTCAGGCAGA AGAATGGTAT TTTGGCAAAA 300 

TGGGGAGAAA AGATGCTGAA AGATTACTTC TGAATCCTGG AAATTAATGA GGTATTTTCT 36 0 

TAGGAAGAGA GAGTGAAATG GCTGGGTGCA GTGGCTCATG CCTGTAATCC CAGCACTTTG 420 
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GGAGGCCGAG 


TTGGGCGGAT 


CACCTGAGGT 


CAGGAGTTCG 


AGACTAGCCT 


GGCCAACATG 


480 


GTGAAACCCC 


ATCTCTACTA 


AAAAAAAAAG 


TACAAAATTA 


GCTGGACGTG 


GTGGTGAGTG 


540 


CCTGTAATCC 


CAGCTACTCA 


GGAGGCTGAG 


GCAGCAGAAT 


CACTTGAACC 


TGGGAGGCGG 


600 


AGGTTGCAGT 


GAGCTGAGAT 


CGCGCCACTG 


CACTCCAGCC 


TCGGCGACAA 


GAGCAAAAAC 


660 


TCCGTCTAAA 


AAACAAATAA 


GCAAACAGAA 


CAAAACAAAA 


CAAAAACGAG 


AGAGCGAAAC 


720 


TACTAAAGGT 


GCTTATTCCC 


TCTCTATTCG 


TGATTGGGAT 


GAGGTAAGGG 


GTGACAATGT 


780 


GAAACACCAC 


AAAATTAGGA 


AACTTGACAA 


TGGTAGATAC 


TATATCACAA 


CCAGAGAACA 


840 


ACTTGATACT 


CTGCAGAAAT 


TGGCAAAACA 


CTACACAGAA 


CATG CTGATG 


GTTTATGCCA 


900 


CAAGTTAACA 


ACTGTGTGTC 


CAACTGTGAA 


ACCTCAGATT 


CAAGGTCTAG 


CAAAAGATGC 


960 


TTGGGAAATC 


CCTTGATAAT 


CTTTGCGACT 


AGAGGTTAAA 


CTAGGACAAG 


GATGTTTTGG 


1020 


CAAAGTGTGG 


ATGGGAATAT 


GGAATGGAAC 


CACAAAAGTA 


GCAATCAAAA 


CACTAAAACC 


1080 


AGGTACAATG 


ATGCCAGAAG 


CTTTTCTTCA 


AGAAGCTCAG 


GTAATGAAAA 


AAATAAGACA 


1140 


TGGTAAACTT 


GTTCCACTAT 


ATGCTGTTGT 


TTCTGAAGAG 


CCAATTTACA 


TTGTCACTGA 


1200 


ATTGATGTCA 


AAAGGAAGCT 


TATTCAATTT 


CCTTAAGGAA 


GGAGATGGAA 


AGTATTTGAA 


1260 


GCTTCCACAA 


ATGGTTGATA 


TGCCTGCTCA 


GATTGCTGAT 


GGTATGGCAT 


ATATTAAAAG 


1320 


AATGAACTAT 


ATTCACCGAG 


ATCTCTGGGC 


TGCTAATATT 


CTTGTAGGAG 


AAAATCTTCT 


1380 


GTGCAAAATA 


GCAGATTTTG 


GTTTAGCAAG 


GTTAATTGAA 


GACAATGAAT 


ACACATCAAG 


1440 


ACAAGGTGCA 


GAATTTCCAA 


TCAAATGGAC 


AGCTCCTGAA 


GTTGCACTGT 


ATGGTGGGTT 


1500 


TACAATAAAG 


TCTGGTGTCT 


GCTCATTTGG 


AATTCTACAG 


ACAGAACTGG 


TAACAAAGGG 


1560 


CAGAGTGCCA 


TATCCAGGTA 


TGGTGAACCA 


TGAAATACTG 


GAACAGGTGG 


AGCGAGGATA 


1620 


CAGGATGCCT 


TGCCCTCAGG 


GCTGTCCAGA 


ATCCCTCCAT 


GAATTGATGA 


ATCTGTGTTG 


1680 


GAAGAAGGAC 


C CTGATG AAA 


GACCAACATT 


TGAATATGTT 


CAGTCCTTCT 


TGGGAGACTA 


1740 


CTTCACTGCT 


ACAGAGCCAT 


AGTACCAGCC 


AGGAGAAAAC 


TTCTAATTCA 


AGTAGCCTAT 


1800 


TTTA 
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Claims 

1 . A cellular immunogen for immunizing a host against the 
effects of the product of a target proto-oncogene, the overexpression of which 
target proto-oncogene is associated with a cancer, which cellular immunogen 
comprises host cells which have been transfected with at least one transgene 
construct comprising at least one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of the transgene in the transfected 
cells, the transgene encoding a gene product which induces host 
immunoreactivity to host self-determinants of the product of the target proto- 
oncogene gene. 

2. An immunogen according to claim 1 wherein the transgene 

comprises 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 

3. An immunogen according to claim 2 wherein the transfected 
cells are non-dividing. 

4. An immunogen according to claim 2 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

5. An immunogen according to claim 4 wherein the mutant DNA 
is nontransforming. 

6. An immunogen according to claim 5 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 
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7. A cellular immunogen according to claim 6 wherein the host 
cells have been transfected with a plurality of transgene constructs, each 
construct encoding a different deletion mutation. 



8. An immunogen according to claim 1 wherein the host cells 
have been transfected with a transgene cognate to a target proto-oncogene 
selected from the group of proto-oncogenes consisting of AKT-2 , c-erbfl-2, 
MDM-2, c-myc, c-myb, c-ras, c-src and c-yes. 

9. An immunogen according to claim 1 wherein the cells 
comprise fibroblasts. 



10. A method for preparing a cellular immunogen for 
immunizing a host against the effects of the product of a target proto-oncogene, 
the overexpression of which target proto-oncogene is associated with a cancer, 
the method comprising: 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of 
the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene . 



11. A method according to claim 11 wherein the transgene 

comprises 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 
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12. A method according to claim 1 1 wherein the iransfected cells 
are non-dividing. 

13. A method according to claim 11 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

14. A method according to claim 13 wherein the mutant DNA 
is nontransforming. 

15. A method according to claim 14 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 

16. A method according to claim 15 wherein the host cells are 
transfected with a plurality of transgene constructs, each construct encoding a 
different deletion mutation. 

17. A method according to claim 11 wherein the transgene is 
cognate to a target proto-oncogene selected from the group of proto-oncogenes 
consisting of AKT-2, c-erW?-2, MDM-2, c-myc, c-myb, c-ras, c-src and c-yes. 

18. A method according to claim 1 wherein the excised cells 
comprise fibroblasts. 

19. A method of vaccinating a host against disease associated 
with the overexpression of a target proto-oncogene comprising 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of 
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the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene; 

(c) returning the excised cells transfected 
with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 



20. A method according to claim 19 wherein the transgene 

comprises 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 



21 . A method according to claim 20 wherein the transfected cells 
are rendered non-dividing prior to return to the body of the host. 

22. A method according to claim 20 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

23. A method according to claim 22 wherein the mutant DNA 
is nontransforming. 

24. A method according to claim 23 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 



25. A method according to claim 24 wherein the host cells are 
transfected with a plurality of transgene constructs, each construct encoding a 
different deletion mutation. 
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26. A method according to claim 19 wherein the transgene is 
cognate to a target proto-oncogene selected from the group of proto-oncogenes 
consisting of AKT-2, z-erbB-2, MDM-2, c-myc, z-myb* c-ras t c-src and c-yes. 



27. A method according to claim 19 wherein the excised host 
cells comprise fibroblasts. 



28. A method of vaccinating a host against disease associated 
with the overexpression of a targeted proto-oncogene comprising 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
transgene and a strong promoter to drive the 
expression of the transgene in the transfected 
cells, wherein the transgene comprises 

(1) wild-type or mutant cognate retroviral 
oncogene DNA; or 

(2) wild-type or mutant cognate proto- 
oncogene DNA of a species different from the 
host species; 

(c) returning the excised cells transfected 
with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 
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A cellular immunogen is provided for immunizing a host against the effects of the product of a target proto-oncogene, where the 
overexpression of the target proto-oncogene is associated with a malignancy. The cellular immunogen comprises host cells which have 
been transfected with at least one transgene construct comprising a transgene cognate to the target proto-oncogene and a strong promoter to 
*! SJf Sl0n of ^ towigene ^ the transfected cells. The transgene encodes a gene product which induces host immunoreactivity 
to host self-determinants i of the product of the target proto-oncogene gene. The transgene may comprise, for example, wild-type or mutant 
^h^Tl^ C08Cr1 ^ DN ^ l c 1 °8 natc to proto-oncogene; or wild-type or mutant proto-oncogene DNA of a species different from 

tMnSS^^'t^ ■mmunogen may be prepared from biopsied host cells, e.g. skin fibroblasts, which are stably or transiently 

Ir ^n lT^ T*£?' 7TT C ° nU i ning the COgnatC tranSgenc ' The host cells transfected with the cognate transgene construct 
are then returned to the body of the host to obtain expression of the cognate transgene in the host. 
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"CELLULAR IMMUNOGENS USEFUL AS CANCER VACCINES" 

Cross-Referenc e to Related Application 
Priority from U.S. provisional patent application No. 60/010,262, 
filed January 19, 1996 is claimed. 

Field of the Invention 

The invention relates to the field of cancer vaccination and 
immunotherapy . 



20 



Background of the Invention 

A current goal of cancer research is the identification of host 
10 factors that either predispose to tumor formation or serve to enhance tumor 
growth. 

Genes that confer the ability to convert cells to a tumorigenic 
state are known as oncogenes. The transforming ability of a number of 
retroviruses has been localized in individual viral oncogenes (generally \-onc). 
Cellular oncogenes (generally c-onc) present in many species are related to viral 
oncogenes. It is generally believed that retroviral oncogenes may represent 
escaped and/or partially metamorphosed cellular genes that are incorporated into 
the genomes of transmissible, infectious agents, the retroviruses. 

Some c-onc genes intrinsically lack oncogenic properties, but may 
be converted by mutation into oncogenes whose transforming activity reflects 
the acquisition of new properties, or loss of old properties. Amino acid 
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substitution can convert a cellular proto-oncogene into an oncogene. For 
example, each of the members of the z-ras proto-oncogene family (H-ras, N-ras 
and K-ras) can give rise to a transforming oncogene by a single base mutation. 

Other c-onc genes may be functionally indistinguishable from the 
5 corresponding v-onc, but are oncogenic because they are expressed in much 
greater amounts or in inappropriate cell types. These oncogenes are activated 
by events that change their expression, but which leave their coding sequence 
unaltered. The best characterized example of this type of proto-oncogene is c- 
myc. Changes in MYC protein sequence do not appear to be essential for 
10 oncogenicity. Overexpression or altered regulation is responsible for the 
oncogenic phenotype. Activation of c-myc appears to stem from insertion of a 
retroviral genome within or near the c-myc gene, or translocation to a new 
environment. A common feature in the translocated loci is an increase in the 

level of c-myc expression. 

15 Gene amplification provides another mechanism by which 

oncogene expression may be increased. Many tumor cell lines have visible 
regions of chromosomal amplification. For example, a 20-fold c-myc 
amplification has been observed in certain human leukemia and lung carcinoma 
lines. The related oncogene N-myc is five to one thousand fold amplified in 

20 human neuroblastoma and retinoblastoma. In human acute myeloid leukemia 
and colon carcinoma lines, the proto-oncogene c-myb is amplified five to ten 
fold. While established cell lines are prone to amplify genes, the presence of 
known oncogenes in the amplified regions, and the consistent amplification of 
particular oncogenes in many independent tumors of the same type, strengthens 

25 the correlation between increased expression and tumor growth. 

Immunity has been successfully induced against tumor formation 
by inoculation with DNA constructs containing v-onc genes, or by inoculation 
with v-onc proteins or peptides. A series of reports describe a form of 
"homologous" challenge in which an animal test subject is inoculated with either 

30 v-src oncoprotein or DNA constructs containing the v-src gene. Protective 
immunity was induced against tumor formation by subsequent challenge with v- 



WO 97/25860 



PCT/US97/005M 



- 3 - 



src DNA or v-src- induced tumor cells. See, Kuzumaki et al, JNCI (1988), 
80:959-962; Wisner et al, J. Virol. (1991), 65:7020-7024; Halpern et al. 
Virology (1993), 197:480-484: Taylor et al, Virology (1994), 205:569-573; 
Plachy et al, Immunogenetics (1994), 40:257-265. A challenge is said to be 
5 "homologous" where reactivity to the product of a targeted gene is induced by 
immunization with the same gene, the corresponding gene product thereof, or 
fragment of the gene product. A challenge is "heterologous" where reactivity 
to the product of a targeted gene is induced by immunization with a different 
gene, gene product or fragment thereof. 

10 WO 92/ 14756 (1992) describes synthetic peptides and oncoprotein 

fragments which are capable of eliciting T cellular immunity, for use in cancer 
vaccines. The peptides and fragments have a point mutation or translocation as 
compared to the corresponding fragment of the proto-oncogene. The aim is to 
induce immunoreactivity against the mutated proto-oncogene, not the wild-type 
15 proto-oncogene. WO 92/14756 thus relates to a form of homologous challenge. 

EP 119,702 (1984) describes synthetic peptides having an amino 
acid sequence corresponding to a determinant of an oncoprotein encoded by an 
oncogenic virus, which determinant is vicinal to an active site of the 
oncoprotein. The active site is a region of the oncoprotein required for 
20 oncoprotein function, e.g., catalysis of phosphorylation. The peptides may be 
used to immunize hosts to elicit antibodies to the oncoprotein active site. EP 
119,702 is thus directed to a form of homologous challenge. 

The protein product encoded by a proto-oncogene constitutes a 
self antigen and, depending on the pattern of its endogenous expression, would 
25 be tolerogenic at the level of T cell recognition of the self peptides of this 
product. Thus, vaccination against cancers which derive from proto-oncogene 
overexpression is problematic. 

Recent attempts have been made to induce immunity in vitro or 
in vivo to the product of the HER-2//n?k proto-oncogene. The proto-oncogene 
30 encodes a 185-kDa transmembrane protein. The HER-2/neu proto-oncogene is 
overexpressed in certain cancers, most notably breast cancer. In each report 
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discussed below, the immunogen selected to induce immunity comprised a 
purified peptide of the pl85 HER ' 2Mw protein, and not a cellular immunogen. 

Disis et aL, Cancer Res. (1994) 54:16-20 identified several 
breast cancer patients with antibody immunity and CD4+ helper/inducer T-cell 
5 immunity responses to pl85 HER2/n<rw protein. Antibodies to plS5 HEK ' 2fneu were 
identified in eleven of twenty premenopausal breast cancer patients. It was 
assumed prior to this work that patients would be immunologically tolerant to 
HER-2/ neu as a self-protein and that immunity would be difficult to generate. 

Disis et aL, Cancer Res. (1994) 54:1071-1076 constructed 

10 synthetic peptides identical to pl85 HER2/n ™ protein segments with amino acid 
motifs similar to the published motif for HLA-A2.1 -binding peptides. Out of 
four peptides synthesized, two were shown to elicit peptide-specific cytotoxic 
T-lymphocytes by primary in vitro immunization in a culture system using 
peripheral blood lymphocytes from a normal individual homozygous for HLA- 

15 A2. Thus, it was concluded that the plS5 HVR ' 2lneu proto-oncogene protein 
contains immunogenic epitopes capable of generating human CD8 4 cytotoxic T- 
lymphocytes. 

The cytotoxic T cells elicited in the latter report were not, 
however, shown to recognize tumor cells, but only targets that bound the 

20 synthesized peptides. Other work (Dahl et aL, J. Immunol. (1996), 157:239- 
246) has demonstrated that cytotoxic cells may recognize targets that bind 
peptide but fail to recognize targets that endogenously synthesize peptide. It is 
thus unclear whether the cytotoxic cells elicited by Disis et aL would be capable 
of recognizing tumor cells. In any event, no protection against tumor growth 

25 was demonstrated by Disis et aL 

Peoples etaL, Proc. NatL Acad. Sci. USA (1995), 92:432-436, 
report the identification of antigenic peptides presented on the surface of ovarian 
and breast cancer cells by HLA class I molecules and recognized by tumor- 
specific cytotoxic T lymphocytes. Both HLA-A2-restricted breast and ovarian 

30 tumor-specific cytotoxic T lymphocytes recognized shared antigenic peptides. 
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T cells sensitized against a nine-araino acid sequence of one of the peptides 
demonstrated significant recognition of HLA-A2 HER2/neu tumors. 

It remains unclear whether Peoples et al. have successfully 
attacked proto-oncogene-encoded self, as the immunizing peptide which is 
5 expressed in the tumor cells contained an isoleucine at position 2, whereas the 
peptide expressed in norma! tissue contains valine residue at this position. 
Moreover, although stimulation of T cells occurred in vitro, this stimulation 
does not represent a true primary immune response insofar as the starting T cell 
population represented tumor infiltrating lymphocytes. 
10 The research accounts of Disis et al. and Peoples et al. required 

a form of in vitro stimulation, either priming as described by Disis et al. , or 
restimulation as described by Peoples et al. The in vitro protocols of Disis et 
al. and Peoples et al. require a mutant cell line to aid in selection of the peptide 
which will serve to induce reactivity. Non-mutant, peptide antigen-presenting 
15 cells have their HLA class I molecules already loaded with endogenous 
peptides, a phenomenon which precludes exogenous loading from without. The 
value of the mutant lines is that they lack the TAP genes (encoding the 
transporters associated with antigen presentation). Class I binding of internally- 
derived peptides is significantly lowered, and "empty" class I molecules are 
present on the cell surface and available for binding of exogenously added 
peptides. This availability of peptide binding sites on membrane-bound class 
I allows examination of whether a given peptide will (i) even bind to class 1, 
and (ii) function as a target in cytotoxic T cell assays. However, the need for 
a mutant cell line for deduction of candidate immunizing peptide sequences 
25 limits the usefulness of peptide-based immunization schemes. 

Fendly et al., J. Biol. Response Modifiers (1990), 9:449-455 
present an account of a polypeptide-based immunotherapy. Purified polypeptide 
corresponding to the extracellular domain of the pl85 HER2 ' w « protein was 
obtained from a transfected cell line. The purified peptide was employed in the 
immunization of guinea pigs. The immunized animals developed a cellular 
immune response, as monitored by delayed-type hypersensitivity. Antisera 
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derived from immunized animals specifically inhibited the in vitro growth of 
human breast tumor cells overexpressing pl&5 HEKVnelt . There is no indication 
by Fendly et al of induction of self versus non-self reactivity. It is likely that 
the guinea pigs were chiefly responding to non-self determinants (as defined in 

5 terms of the guinea pig host) on the human polypeptide immunogen. 

The use of peptides for immunization is of necessity limited to 
immunization with a single haplotype. There are approximately thirty HLA 
types in man. In each case of peptide immunization, one must be careful to 
select peptides which match the host HLA type. The selected peptide must be 

10 immunogenic in the host and be capable of presentation to host immune system 
cells. 

What is needed is an immunization method for immunizing 
humans and animals against self-encoded proto-oncogenes which are associated 
with the development of cancer, which dispenses with the need for isolating 
15 immunogenic, HLA host-matched peptides for immunization. 



Summary of the Invention 

It is an object of the invention to induce reactivity to self- 
determinants of the product of an overexpressed proto-oncogene. 

It is an object of the invention to provide for a form of therapy 
20 or prophylaxis based upon the capacity to induce immune reactivity to proto- 
oncogene-encoded self as overexpressed in tumor cells. 

It is an object of the invention to provide a cellular immunogen 
for use in immunization against self proto-oncogene determinants. 

It is an object of the invention to provide for a method for 
25 vaccinating a host against disease associated with the overexpression of a proto- 
oncogene. 

These and other objects will be apparent from the following 

disclosure. 

A method of vaccinating a host against disease associated with the 
30 overexpression of a target proto-oncogene is provided. The method comprises: 
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(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 

5 and a strong promoter to drive the expression of 

the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene; 
10 (c) returning the excised cells transfected 

with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 

According to one principal embodiment of the invention, the 
15 transgene comprises wild-type or mutant retroviral oncogene DNA. According 
to another principal embodiment of the invention, the transgene comprises wild- 
type or mutant proto-oncogene DNA of a species different from the host 
species. Where the transgene comprises mutant retroviral oncogene DNA or 
mutant proto-oncogene DNA, the mutant DNA is preferably nontransforming. 
20 The mutant DNA preferably comprises a deletion mutation in a region of the 
DNA which is essential for transformation. Preferably, the host cells are 
transfected with a plurality, most preferably at least five, different transgene 
constructs, each construct encoding a different deletion mutation. 

In one preferred embodiment of the invention, the mutant DNA 
has at least about 75% homology, more preferably at least about 80% 
homology, most preferably at least about 90% homology, with the 
corresponding wild-type oncogene or proto-oncogene DNA. 

The invention is farther directed to a cellular immunogen for 
immunizing a host against the effects of the product of a target proto-oncogene, 
the overexpression of which is associated with a cancer. The cellular 
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immunogen comprises the host cells which have been transfected with at least 
one transgene construct, as described above. 

The invention is also directed to a method of preparing the 
cellular immunogen, by (a) excising cells from the host, and (b) transfecting the 
5 excised cells with at least one transgene construct, as described above. 

The cells transfected with the transgene are preferably rendered 
non-dividing prior to return to the body of the host. 

The term "corresponds to" is used herein to mean that a 
polynucleotide sequence is homologous (i.e., is identical, not strictly 
10 evolutionarily related) to all or a portion of a reference polynucleotide sequence, 
or that a polypeptide sequence is identical to a reference polypeptide sequence. 

The term "cognate" as used herein refers to a gene sequence that 
is evolutionarily and functionally related between species. For example but not 
limitation, in the human genome, the human c-myc gene is the cognate gene to 
15 the mouse c-myc gene, since the sequences and structures of these two genes 
indicate that they are highly homologous and both genes encode proteins which 

are functionally equivalent. 

By "homology" is meant the degree of sequence similarity 
between two different amino acid sequences, as that degree of sequence 
20 similarity is derived by the FASTA program of Pearson and Lipman, Proc. 
Natl. Acad. Sci. USA (1988), 85:2444-2448, the entire disclosure of which is 

incorporated herein by reference. 

As used herein, the term "operably linked" refers to a linkage of 
polynucleotide elements in a functional relationship. A nucleic acid is "operably 

25 linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the coding sequence. Operably linked 
means that the DNA sequences being linked are typically contiguous and, where 
necessary to join two protein coding regions, contiguous and in reading frame. 

30 The word "transfection" is meant to have its ordinary meaning, 

that is, the introduction of foreign DNA into eukaryotic cells. 
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By "transgene" is meant a foreign gene that is introduced into one 
or more host cells. 

By "transgene construct" is meant DNA containing a transgene 
and additional regulatory DNA, such as promoter elements, necessary for the 
expression of the transgene in the host cells. 
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Description nf the Figures 
Figs. 1A and IB are plots of the mean tumor diameter over time 
following subcutaneous wing web inoculation of 1 -day-old line TK (Fig. 1A) 
and line SC (Fig. IB) chickens with 100 M g of tumorigenic plasmids pwrc527 
(_*_), pVSRC-Cl (-•-) or pMwc (--■-). The mean tumor diameter 
(mm) at a particular time point and for any one group of TK or SC line 
chickens inoculated was computed as the sum of the diameters of the primary 
tumors divided by the number of chickens surviving to that point. The ratios 
at each time point show, for a particular group, the number of chickens bearing 
palpable tumors to the total number of survivors to that point (standard typeface 
for pwrc527, italics for pVSRC-Cl, bold typeface for pMWsrc). Error bars 
(unless obscured by the symbol) indicate standard error. 

Figs. 2A and 2B are plots of the growth of challenge (wing web) 
tumors in test and control line TK chickens under conditions of (i) priming and 
homologous challenge with plasmid pcjrc527 (Fig.2A: —a—, test; — a — 
control), or (ii) priming and homologous challenge with plasmid pVSRC-Cl 
(Fig. 2B: -O-, test; control). Test chickens were primed at 1 day 

posthatch with 100 „g of construct; test and control chickens were challenged 
at five weeks posthatch with 200 n of construct. The mean challenge diameter 
was computed as in Figs. 1 A and IB. At each time point the ratio of chickens 
bearing palpable challenge tumors to total number of survivors to that point is 
indicated (standard typeface for control group, bold typeface for test group). 
The statistical comparison between the mean challenge tumor diameters of the 
test versus the control group at a particular time point was made using a two- 
tailed student's t test, *(p<0.05), **(p<0.01), ***(p<0.001). The statistical 
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comparison between the ratios of chickens bearing palpable challenge tumors to 
total number of survivors of the test versus the control group at a particular 
time point was made using a chi-squared test; the paired ratios are underlined 
for only those time points where p<0.05. Error bars indicate standard error. 

Figs. 3 A and 3B are plots of the growth of challenge (wing web) 
tumors in TK chickens under conditions of (i) priming with plasmid pVSRCd 
and heterologous challenge with plasmid pcsrc527 (Fig. 3A: —a—, test; 
— a—, control) or (ii) priming with pcsrc527 and heterologous challenge with 
pVSRC-Cl (Fig. 3B: — O— , test; — f control). Test chickens were 
primed at 1 day posthatch with 100 jig of construct; test and control chickens 
were challenged at five weeks posthatch with 200 yg of construct. The mean 
challenge tumor diameter was computed as in Figs. 1A and IB. At each time 
point the ratio of chickens bearing palpable challenge tumors to total number of 
survivors to that point is indicated (standard typeface for control group, bold 
typeface for test group). Statistical comparisons were made between test and 
control groups at a particular time point as described for Figs. 2A and 2B. 
[*(p<0.05), **(p<0.01), ***(p< 0.001), for the student's t test], and the 
paired ratios are underlined for only those time points where, in the chi-squared 
test, p< 0.05. Error bars indicate standard error. 

20 Detailed Desc ription of the Invention 

A vaccination strategy is provided to prevent development of 
cancers. The vaccination method may be carried out on a subject at risk for a 
particular cancer, but before the development of the cancer. The practice of the 
invention may serve for the immunoprevention of prevalent human cancers, 

25 such as colon carcinoma, breast carcinoma, and various lymphomas whose 
progress is accompanied by the overexpression of a cellular proto-oncogene. 

The vaccination strategy of the present invention relies on the 
induction of an immune response that targets tumor cells by virtue of the 
recognition of the proto-oncogene-specific antigenicity. The aim of the vaccine 

30 protocol is to induce reactivity to self-determinants of an overexpressed proto- 
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oncogene product. The strategy exploits the structural relatedness between the 
product of the cellular proto-oncogene and that of the product of genes cognate 
to the target proto-oncogene. The cognate gene may comprise a wild-type or 
mutant cognate retroviral oncogene or a wild-type or mutant proto-oncogene 
5 of a species different from the host species. The starting point of the vaccine 
strategy is the high degree of primary sequence homology that exists between 
the protein product of a targeted proto-oncogene and that of its cognate 
retroviral oncogene, or between the proto-oncogene product and the product of 
a cognate proto-oncogene from a different species. However, in contrast to 
10 other proposed vaccine strategies, the present invention is not based on the 
immune recognition of a determinant defined by a cancer specific mutation. 

For those tumors showing proto-oncogene overexpression, this 
sequence homology permits application of the following strategy, which can be 
employed either prophylactically or therapeutically under conditions of cell- 
15 surface expression, or other forms of adjuvanicity, as chosen to enhance 
immunogenicity: (a) immunization of host biopsied cells with a DNA construct 
comprising a transgene cognate to the target proto-oncogene, which transgene 
encodes a gene product which induces host immunoreactivity to host self- 
determinants of the product of the target proto-oncogene; (b) return of the 
20 transfected cells to the body of the host to obtain expression of the transgene in 
the host, and thus immunity against the proto-oncogene product. The invention 
relies on the targeting of a self-determinant found on an overexpressed or 
overabundant proto-oncogene-encoded product. The foreign peptide elements 
of the immunizing oncogene product will trigger peripheral lymphocytes 
exhibiting a weak cross reactivity for the self peptides of the targeted proto- 
oncogene product. Although such self peptides would be present in normal cells 
expressing the proto-oncogene, targeting of the tumor cells is favored in view 
of their overexpression of the proto-oncogene. 

The immune strategy exploits the antigenicity of two alternative 
30 types of determinants: ( 1 ) tumor-associated antigenic determinant(s) induced as 
a consequence of the activity of the oncogene product, e.g., an enzymatic 
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modification of a cellular protein effected by the oncogene product, or (2) tumor 
associated antigenic determinant(s) intrinsic to the oncogene-encoded product 
itself. The difficulty in exploiting the first alternative by traditional means, i.e. , 
antigen purification, is that at present little or no systematic information exists 

5 bearing on the properties of an antigen that, though oncogene-induced, is not 
oncogene-encoded. This situation makes purification of any such antigen 
problematic. However, this problem is obviated from the outset by the present 
invention which utilizes biopsied cells which, as transfected in culture by the 
cognate retroviral oncogene, would express the relevant antigenicity. 

10 In terms of exploiting the second alternative, that of an 

antigenicity intrinsic to the proto-oncogene product, a relevant consideration is 
that the protocol of immunization according to the present invention primes the 
host to determinants of the oncogene product itself. A consequence of this 
immunization is induction of T-cell reactivity to the divergent, i.e foreign, 

15 peptide determinants of the retroviral oncogene product, i.e., those peptide 
determinants that show sequence differences with the positionally homologous 
determinants of the cellular proto-oncogene product. The induction of this 
reactivity does not in itself have vaccine potential, since the foreign 
determinants specific to the retroviral oncogene product are normally absent 

20 from the cellular proto-oncogene product. Nevertheless, the foreign peptide 
elements, notably those that differ by only a single amino acid from the 
positionally homologous self peptides, trigger peripheral T-lymphocytes 
exhibiting a weak cross-reactivity for the self peptides. Although such self 
peptides are present in normal cells expressing the proto-oncogene, targeting of 

25 the tumor cells is favored in view of their overexpression of the proto-oncogene. 

It is possible that many tumor-associated and overexpressed 
proto-oncogenes might possess mutations. In some cases, overexpression may 
very well arise as a direct consequence of one or more of the mutations. 
However, the present vaccination method does not have as its object the 

30 deliberate targeting of non-self determinants generated by proto-oncogene 
mutations. Unlike prior vaccination methods designed to target such mutation- 
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driven non-self determinants, it is the aim of the present invention to induce 
reactivity for self-determinants in the overexpressed product of tumor associated 
and overexpressed proto-oncogenes. 

Prior efforts attempting to elicit reactivity to proto-oncogene self 
5 determinants have relied on in vitro protocols utilizing mutant cell lines to 
identify individual self peptide immunogens (Disis et ai, Cancer Res. (1994) 
54:1071-1076; Peoples et al., Proc. Natl. Acad. Sci USA (1995), 92:432-436). 
According to the present invention, the host immune system is presented with 
the full array of naturally-derived class I binding peptides. The vaccine strategy 
10 of the present invention obviates the need for any a priori assessment of the 
immunogenic ity of individual peptides. 

While the cellular immunogens of the invention display self 
peptides, non-self peptides would also be presented which may serve as more 
effective tolerance breakers. The value of a non-self, but closely related to self. 
15 peptide is that it may more readily activate those T cells that have both a weak 
cross reactivity for the cognate self peptide and an activation threshold 
(determined by the tightness of binding to the T cell receptor) too high to be 
triggered by the self peptide. Moreover, cognate non-self is inductive of a good 
immune response, simply because it does in fact constitute nonself. The non- 
20 self immune response is expected to predispose the induction of the inevitably 
weaker response to the self determinants on the same protein product, since the 
resultant cytokine release provides local help to initiate the weaker anti-self 
response. 

As hereinafter exemplified in a model of J/c-oncogene-based 
25 tumor formation, immunization with cells transfected with a transgene construct 
expressing the v-src oncogene product induces reactivity to the product of the 
c-src proto-oncogene, thereby conferring protection against the growth of 
tumors displaying overexpression of the c-src proto-oncogene. 
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Target Proto-Oncopenes 

According to the present invention, patients with a family history 
of a cancer characterized by the overexpression of a particular proto-oncogene 
are selected for immunization. Alternatively, patients whose tumors can be 

5 shown to overexpress the proto-oncogene are selected. Overexpression of a 
proto-oncogene may derive from an increase over a basal level of transcription. 
Overexpression may also derive from gene amplification, that is, an increase in 
gene copy number, coupled with a basal or elevated level of transcription. 
Proto-oncogene overexpression may be assayed by conventional probing tech- 

10 niques, such as described in Molecular Cloning: A Laboratory Manual J. 
Sambrook et aL, eds., Cold Spring Harbor Laboratory Press, 2nd ed. 1989. 
The level of target proto-oncogene expression may be determined by probing 
total cellular RNA from patient cells with a complementary probe for the rele- 
vant mRNA. Total RNA from the patient cells is fractionated in a glyox- 

15 al/agarose gel, transferred to nylon and hybridized to an appropriately labelled 
nucleic acid probe for the target mRNA. The number of relevant mRNA tran- 
scripts found in the patient cells is compared to that found in cells taken from 
the same tissue of a normal control subject. 

As an alternative to measuring mRNA transcripts, the expression 

20 level of a target proto-oncogene may be assessed by assaying the amount of 
encoded protein which is formed. Western blotting is a standard protocol in 
routine use for the determination of protein levels. See Molecular Cloning, 
supra, Chapter 18, incorporated herein by reference. Accordingly, a cell lysate 
or other cell fraction containing protein is electrophoresed on a polyacrylamide 

25 gel, followed by protein transfer to nitrocellulose, and probing of the gel with 
an antibody specific for the protein in question. The probe step permits 
resolution of the desired protein from all other proteins in the starting mixture. 
The bound antibody may be prelabeled, e.g. , by a radioisotope such as 125 I, so 
as to permit its detection on the gel. Alternatively, a secondary reagent (usually 

30 an anti-immunoglobin or protein A) may be radiolabeled or covalently coupled 
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to an enzyme such as horseradish peroxidase or alkaline phosphatase. The 
strength of the signal is proportional to the amount of the target protein. The 
strength of the signal is compared with the signal from a sample analyzed in the 
same manner, but taken from normal as opposed to tumor tissue. 

A description of the methodology and use of Western blotting to 
determine the levels of the c-j/r-encoded protein pp60 c Jrr in adenomatous polyps 
(colonic epithelia) is provided by Cartwright et al. , Proc. Natl. Acad. Sci. USA 
(1990), 87:558-562, the entire disclosure of which is incorporated herein by 
reference. 

An at least about eight-fold increase in that gene's expression in 
the patient cells compared to expression in normal control cells from the same 
tissue would indicate candidacy for vaccination. 

Table 1 includes a partial list of representative proto-oncogenes, 
the overexpression of which has been associated with one or more malignancies. 
Each listed proto-oncogene is a target proto-oncogene according to the present 
invention. The corresponding oncogene, of which the target proto-oncogene is 
the normal cellular homoiog, is also identified. This list of target proto- 
oncogenes is intended to be representative, and not a complete list. 

Table 1 

Representative List of Target Proto-Oncog enes 

Proto- 

Oncogene Tumor Comments/References 

v-Akt is the oncogene of the AKT8 virus, which 
induces lymphomas in mice. 
1. Bellacosa et al., (1995) Int. J. Cancer 
64(4): 280-5; Southern-blot analysis has shown 
AKT-2 amplification in 12.1% of ovarian 
carcinomas, while Northern bot analysis has 



AKT-2 ovarian 

25 
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revealed overexpression of AKT-2 in 3 of 25 
fresh ovarian carcinomas which were negative for 
AKT-2 amplification. 

2. Cheng et ai. (1996) Proc. Natl. Acad. Set. 
USA 89(19): 9267-71): Amplification of AKT-2 
has been detected in 10% of pancreatic 
carcinomas. 

AKT-2 pancreatic Cheng et a/., (1996) Proc. Natl Acad. Sci. USA 

93(8): 3636-41: Amplification of AKT-2 has been 
detected in 10% of pancreatic carcinomas. 

c-erbB-2 bladder c-ErbB-2 is also known as HER2/neu. V-erbB is 

the oncogene of the avian erythroblastosis virus. 

1. Underwood et a/., (1995) Cancer Res. 
55(ll):2422-30: Protein overexpression was 
observed in 45% of patients with non-recurrent 
disease and 50% of patients with recurrent 
disease; 9% of bladder tumors analyzed shoed 
gene amplification. 

2. Coombs et a/., (1993) Pathology 169(1):35- 
42: c-ErbB-2 gene amplification was observed in 
14% of bladder tumors analyzed. 

3. Gardiner et al , (1992) Urolog. Res. 20(2): 17- 
20: Nineteen percent of primary transitional cell 
bladder carcinomas showed c-er6B-2 gene 
amplification. 

c-erbB-2 breast 1. Molina et al., (1966) Anticancer Research 

16(4B):2295-300: Abnormal c-erbB-2 levels were 
found in 9.2% of patients with locoregional breast 
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carcinoma, and in 45.4% of patients with 
advanced disease. 2. DePotter et al. n (1995) 
Virchows Arch. 426(2): 107- 15: Overexpression of 
the oncoprotein is observed in about 20% of 
invasive duct cell carcinomas of the breast. 3. 
Bandy opadhyay et al, (1994) Acta Oncol 
33(5):493-8: 35.4% of breast tumors showed c- 
erbB-2 overexpression; 17.4% showed gene 
amplification. 4. Fontana et ai, (1994) 
Anticancer Res. 14(5B):2099-104: 26% of 
samples showed c-erbB-2 amplification. 5. Press 
et ai, (1993) Cancer Research 53(20):4960-70: 
Amplified overexpression was identified in 38% 
of primary breast cancers. 6. Berns et al. % 
(1992) Cancer Res. 52(5): 1107-13: 23% of 
primary breast cancer tissues exhibited 
amplification. 7. Delvenne etal., (1992) Eur. J. 
of Cancer 28(2-3): 700-5: c-erbB-2 mRNA was 
overexpressed in 34% of breast tumor samples. 
8. Inglehart, (1990) Cancer Res. 50(20): 670 1-7: 
Two to thirty-two-fold gene amplification was 
found in multiple stages of tumor progression. 9. 
Slamon et ai, (1989) Science 244:707-12: A 
28% incidence of amplification of c-erbB-2 was 
found in 189 primary breast cancers. 10. Kraus 
et al. % (1987) EMBO J. 6(3):605-10: Eight cell 
lines demonstrated c-^B-2 mRNA levels ranging 
from 4 to 128- fold overexpression. 60% of all 
tumors analyzed showed elevated levels of c-erbB- 
2 mRNA. 
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c-erf>B-2 lung 1. Osaki et a/., (1995) Chest 108(1): 157-62: 

Lung tissue overexpression of c-erbB-2 was 
discovered in 42.5% of samples. 2. Lorenz et 
ai 9 (1994) Clin. Invest. 72(2): 156-63: A 64-fold 
increase in the amount of c-erbB-2 mRNA was 
observed; 33% of lung tumors showed 
overexpression of c-erbB-2. 

1. Katsaros et al. 9 (1995) Anticancer Res. 
15(4): 1501-10: Abnormally high expression of c- 
erbB-2 was found in 31% of tumor samples. 2. 
Felipe/ a/., (1995) Cancer 75(8):2147-52: 21.7% 
of ovarian tumors showed overexpression of c- 
erbB-2. 3. Fan et fl/., (1994) Chin. Med. J. 
107(8):589-93: c-erbB-2 amplification was found 
in 30.8% (8 of 26) of human ovarian cancers. 4. 
vanDam et aL 9 (1994) J. of Clin. Path. 
47(10):914-9: 24% of ovarian tumors showed c- 
erbB-2 overexpression. 5. Csokay et a/., (1993) 
Eur. J. of Surg. Oncology 19(6):593-9: c-erbB-2 
amplification was found in 34% of fresh ovarian 
tumor samples. 6. McKenzie et ai, (1993) 
Cancer 71(12):3942-5: 30% of ovarian tumor 
samples indicated c-erbB-2 overexpression. 7. 
Hung etal, (1992) Cancer Letters 6 1(2): 95- 103: 
A 100-fold c-erbB-2 overexpression was 
discovered in one human cell line. Two to four- 
fold amplification was also discovered. 

MDM-2 leukemia MDM-2 is the murine double minute-2 oncogene. 

1 . Bueso-Ramos et ai , (1993) Blood 82(9):2617- 



c-erbB-2 ovarian 
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23: 53% of cases showed overexpression of 
MDM-2 rnRNA. The level of MDM-2 mRNA 
overexpression in some cases of leukemias was 
comparable to that observed in some sarcomas, 
which demonstrate more than 50-fold MDM-2 
gene amplification. No evidence of gene 
amplification was observed. 2. Watanabe et al. , 
(1994) Blood 84(9):3158-65: 28% of patients 
with B-cell chronic lymphocytic leukemia or non- 
Hodgkin's lymphoma had 10-fold higher levels of 
MDM-2 gene expression. MDM-2 overexpression 
was found more frequently in patients at advanced 
clinical stages. 



c-myb colon V-myb is the oncogene of the avian 

myeloblastoma virus. 1 . Ramsay et al. , (1992) 
Cell Growth and Diff. 3(10):723-30: z-myb levels 
were always higher in colon cancer samples than 
normal tissue. 2. Alitalo et al, (1984) Proc. 
Natl. Acad. Sci. 81(14):4534-8: c-myb levels 
were always higher in colon cancer samples than 
normal tissue. 



c-myc breast V-myc is the oncogene of the avian myelocytoma 

virus. 1. Lonn et al. y (1995) Cancer 
75(1 1):2681-7: Amplification of c-myb occurs in 
16% of patients with breast cancer. 2. Hehir et 
al., (1993) J. of Surg. Oncology 54(4):207-9: c- 
myc overexpression was found in 60 % of breast 
carcinoma samples. 3. Kreipe et al. y (1993) 
Cancer Research 53(8): 1956-6 1 : Amplification of 
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c-myc was found in 52.6% of samples that 
displayed a Ki-Sl labelling index exceeding 30%. 
4. Watson et a/., (1993) J. Nat. Cancer Inst. 
85(ll):902-7: Amplification of c-myc occurs in 

5 up to 20 - 30% of breast cancers. 5. Berns et 

al. y (1992) Cancer Research 52(5): 1107-13: 
Amplification was found in 20% of primary breast 
cancer patients; the range was 3-14 gene copies. 
6. Watanabe et al y (1992) Cancer Research 

10 52( 1 9) :5 178-82: Expression of c-myc was 

increased by 10-fold. 



c-myc 



gastric/ 
colorectal 



15 



20 



25 



1. Rigas, (1990) Clin. Gastroent. 12(5):494-9: 
Overexpression of c-myc is found in 80 of colon 
cancers. 2. Erisman et al. y (1988) Oncogene 
2(4): 367-78: Adenocarcinoma cell lines express 
5-10-fold elevated levels of c-myc mRNA. Eight 
to thirty-seven-fold higher levels of c-myc protein 
was found in tumor cell lines compared to normal 
cells. 3. Sikora et al. y (1987) Cancer 
59(7): 1289-95: Up to 32-fold overexpression of 
c-myc mRNA was observed in 12 to 15 tumors. 
4. Tsuboi et al , (1987) Biochem. and Biophys. 
Res. Comm. 146(2):705-10: Gastric Cancer: A 
2-3-fold overexpression was observed in gastric 
cancer. A 2-10-fold overexpression was observed 



in colorectal cancer. 



c-myc lung 



1. Lorenz et al. v (1994) Clin. Invest. 72(2): 156- 
63: A 57-fold increase in c-myc mRNA levels 
was observed. 23% of samples indicated strong 
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expression of c-myc. 2. Kato et aL , (1993) Jap. 
J. of Cancer Res. 84(4):355-9: Liver tissue 
metastases from human small cell lung carcinoma 
revealed 30-fold amplification of c-myc. 



c-myc 



naso- 

pharn- 

geal 



Porter et aL , (1994) Acta Oto-Laryng. 1 14(1): 
1105-9: 22% of samples showed intense staining 
for c-myc. 



Q-mvc 



ovarian 



1. Bian et aL, (1995) Chin. J. of Ob. Gyn. 
30(7):406-9: 50% of samples showed 
amplification of c-myc. 2. Katsaros et aL, 
(1995) Anticancer Res. 15(4): 1501-10: 26% of 
samples exhibited c-myc amplification. 3. van 
Dam et aL, (1994) /. Clin. Path. 47(10):914-9: 
Overexpression of c-myc was found in 35% of 
ovarian carcinomas. 4. Xin et aL, (1993) Chin. 
./. of Ob. Gyn. 28(7):405-7: 54.5% of samples 
showed amplification of c-myc. 5. Tashiro et aL . 
(1992) Int. J. of Cancer 50(5):828-33: 
Overexpression was found in 63.5% of all serous 
adenocarcinoma tissues and 37.3% of all ovarian 
carcinoma tissues. Significant overexpression of 
c-myc was observed at Stage III compared with 
other stages. 



prostate Nag et aL, (1989) Prostate 15(2): 115-22: A 10- 
fold amplification of c-myc was observed. Fifty- 
fold higher levels of mRNA transcripts of c-myc 
were found. 
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Ras oncogenes were first recognized as the 
transforming genes of Harvey and Kirsten murine 
sarcoma viruses. Lorenz et al y (1994) Clin. 
Invest. 72(2): 156-63: a 13-fold increase in 
overexpression of c-Ki-rcu was observed. 18% of 
tumors displayed strong overexpression of c-Ki- 
ras. 



c-ras 



ovarian 



10 



15 



1. Katsaros et al< (1995) Anticancer Res. 
15(4): 1501-10: Higher levels of ras protein than 
in normal or benign ovarian tumors were found in 
45% of tumor samples. 2. vanDam et aL % 
(1994) J. of Clin. Path. 47(10):914-9: 20% of 
ovarian tumors exhibited c-ras overexpression. 
The levels of expression of c-ras were much 
higher in tumors of patients with recurrent or 
persistent disease after chemotherapy , than in the 
tumors of patients at initial presentation. 



c-src 



breast 



20 



W-src is the oncogene of the Rous sarcoma virus, 
which induces sarcomas in chickens. 
Muthuswamy et al y (1994) Mol and Cell Biol 
14(1) : 735-43 : c-er*B-2-induced mammary tumors 
possessed 6-8-fold higher c-src kinase activity than 
adjacent epithelium. 



c-src 



25 



colon/ 
colorectal 



1 . Cartwright et aL , (1994) J. of Clin. Invest. 
93(2):509-15: c-src activity is 6-10-fold higher in 
mildly dysplastic ulcerative colitis (a chromic 
inflammatory disease of the colon with a high on 
incidence of colon cancer) than in non-dysplastic 
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epithelia. This data suggests that activation of c- 
src is an early event in the genesis of UC colon 
cancer. 2. Talamonti et al., (1993) J. of Clin. 
Invest. 91(l):53-60: High level of c-src activity 
from colorectal cancer is found in liver 
metastases. 3. Termuhlen et al.< (1993) J. of 
Surg. Res. 54(4): 293-8: Colon carcinoma 
metastases to the liver had significantly increased 
activity of c-src with an average 2.2-fold increase. 
Extrahepatic colorectal metastases demonstrated an 
average 12.7-fold increase in c-src activity over 
normal mucosa. 



-yes colon V-yes is the oncogene of two avian sarcoma 

viruses, Esh sarcoma vims and Y73. 1. Pena et 
aL, (1995) Gastroent. 108(1): 117-24: Twelve to 
fourteen-fold higher expression of c-yes was found 
in colonic transforming oncogene adenomas 
compared to normal mucosa. Activity of c-yes 
was elevated in adenomas that are at greatest risk 
for developing cancer. 2. Park et al. % (1993) 
Oncogene 8(10): 2627-35: A ten to 20-fold higher 
than normal activity of c-yes was observed in 3 
out of 5 colon carcinoma cell lines. A 5-fold 
higher than normal activity was found in 10 out of 
21 primary colon cancers, compared to normal 
colonic cells. 
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Selection of Cognate Transgene for Preparation of Cellular Immunogen 

According to the present invention, a transgene construct is 
engineered comprising a transgene which is cognate to the target proto-oncogene 
(hereinafter "cognate transgene" or "CTG"). The transgene is selected such that 
5 it encodes a gene product which induces host immunoreactivity to host self- 
determinants of the product of the target proto-oncogene. The transgene should 
be expressed to very high levels in the transfectants. Thus, the construct should 
contain a strong promoter. 

The product encoded by the cognate gene must have a high 

10 degree of sequence homology with the product of the target proto-oncogene, but 
also must display some amino acid differences with the target proto-oncogene 
product. Thus, there must be a subset of one or more amino acid differences 
between the target proto-oncogene and its cognate in order to provide 
immunogenic stimulus. Two classes of genes that satisfy these criteria are 

15 retroviral oncogenes and xenogenic proto-oncogenes. The word "xenogenic" 
is intended to have its normal biological meaning, that is, a property or 
characteristic referring or relating to a different species. Thus, a xenogenic 
proto-oncogene is meant to include the a homologous proto-oncogene of a 
species other than the host organism species. It may be appreciated that in the 

20 case of a target proto-oncogene, e.g. MDM2, for which no retroviral homolog 
is yet known, a xenogenic homologue is advantageously utilized as the source 
of the DNA for the cognate transgene. 

In principle, a more effective immunogenic stimulus would 
depend on the particular sequence, and not on the distinction between a 

25 retroviral oncogene and a xenogenic proto-oncogene in terms of their relative 
transforming capacity. Thus, in certain cases, a retroviral oncogene may be 
better at providing a tolerance-breaking immunogenic stimulus, and in other 
cases, a xenogenic proto-oncogene may be more effective. 

The retroviral oncogene or xenogenic proto-oncogene DNA 

30 forming the CTG may comprise the wild type oncogene or proto-oncogene 
DNA. More preferably, a mutant DNA is utilized, which is engineered so as 
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to be non-transforming in the host. The DNA is mutated to include one or 
more nucleotide insertions, deletions or substitutions which will encode an 
oncogene product which is nontransforming in the host, but retains the requisite 
degree of sequence homology with respect to the target proto-oncogene. A 
5 cognate transgene deletion mutant (hereinafter "dCTG") is preferred. 

A protein sequence is generally considered "cognate" with respect 
to the target proto-oncogene-encoded protein if it is evolutionarily and 
functionally related between species. A more precise view of cognation is based 
upon the following sequence comparison carried out utilizing the FASTA 
3 program of Pearson and Lipman, Proc. Natl. Acad. Sci. USA (1988), 85:2444- 
2448, the entire disclosure of which is incorporated herein by reference. 
Cognation is attained upon satisfying two criteria imposed by FASTA; (i) 
alignment of segments corresponding to at least 75% of the target proto- 
oncogene's encoded amino acid sequence; (ii) at least 80% amino acid identity 
> within the aligned sequences. The segments of the target proto-oncogene 
protein sequence and protein test sequence satisfying the two criteria are 
referred to as "homology regions". Accordingly, at least 75% of the target 
proto-oncogene protein sequence is alignable with the test sequence. The 
ahgnable segments or homology regions may, however, represent less than 75% 
of the total test polypeptide chain for the case of test sequences that may 
significantly exceed the target proto-oncogene protein in length. 

One skilled in the art, armed with the FASTA program, may 
survey existing sequence data bases (either protein sequences or DNA 
sequences, insofar as the amino acid sequence is determined by FASTA for all 
reading frames) for test sequences which are cognate with respect to the target 
proto-oncogene. At the same time, one can isolate and then sequence what are 
very likely to be cognate test sequences (e.g. feline MDM-2, as likely to be 
cognate to human MDM-2) and use FASTA to verify the presumed cognation, 
according to the criteria set above. One may obtain the sequences of 
presumptive cognate proto-oncogenes from a large number of mammalian 
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sequences and screen these sequences with FASTA according to the aforesaid 

formulation of cognation. 

Because the product encoded by a CTG differs at a small number 
of amino acid positions from the product encoded by the target proto-oncogene, 

5 an immunogenic stimulus is provided that (i) is directed against the foreign 
protein and (ii) with a lower probability, induce an anti-self response. The 
CTG is selected such that the gene product will yield the greatest immunogenic 
stimulus to induce anti-self reactivity. Provided that overall sequence homology 
(preferably greater than about 75%) is maintained, the presence of scattered 

10 amino acid differences is desired, since any one residue would likely have a 
relatively low probability of inducing self-reactivity. Moreover, the greatest 
number of residue differences would be advantageous, consistent with 
maintaining the requisite degree of general sequence homology. 

The selection of amino acid modifications for the CTG may be 

15 facilitated by resort to available computer-based models used to identify 
immunogenic peptide fragments of polypeptides. These models could be 
employed to select CTGs which would possess the maximum number of 
immunogenic peptides for a given HLA haplotype. 

Screening Procedu re for CTG Selection 

20 Notwithstanding the availability of computer-based algorithms 

which have some predictive value, it is desirable to design CTGs with resort to 
a screening procedure based on an actual experimental assay that can be HLA- 
haplotype specific. Accordingly, cells are biopsied from a normal volunteer of 
particular haplotype. The cells are transfected with a CTG construct, preferably 

25 a dCTG construct, satisfying the criteria set for cognition. More preferably, the 
cells are transfected with multiple dCTGs, preferably at least five dCTGs, 
satisfying the criteria for cognition. The at least five dCTGs are selected to 
display amino acid differences that essentially extend throughout the polypeptide 
chains of the encoded sequences. The transfected cells are then used to 

30 immunize the volunteer in accordance with the immunization method of the 
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present invention. After immunization, the human subject is tested in a standard 
delayed hypersensitivity (DH) reaction with 10 4 -10 6 irradiated, autologous 
fibroblasts, as transfected with the same dCTG (or series of dCTGs) as used for 
the immunizing preparation. A positive DH reaction (induration) would verify 
5 the induction of reactivity. The induction of reactivity in this assay is readily 
demonstrable because of the priming to the non-self determinants on the dCTG- 
encoded protein and the readout in the DH reaction of the same nonself 
determinants. Once DH reactivity is demonstrated in a DH reaction that 
directly tests the antigenicity of the non-self determinants encoded by the dCTG 
10 (i.e., priming with a non-self construct, DH testing with the same non-self 
construct), the subject can be then tested in a DH reaction based on testing with 
the autologous cells transfected with a dCTG derived from the human proto- 
oncogene itself (i.e., priming with a non-self construct, testing with the human 
self construct). Testing of a battery of human volunteers will lead to a 
15 catalogue of HLA-matched dCTGs, such that, for individuals of the same HLA 
haplotype, the use of the particular dCTG would be inductive of reactivity to 
proto-oncogene-encoded self. Different CTGs may thus be tested so as to 
correlate maximal secondary stimulation with a particular HLA haplotype. 

At the same time, this procedure may be used with patients 
20 undergoing tumor resection (if post-operative immuno-suppressive protocols are 
not mandatory), such that prior to resection, a course of immunization would 
have been initiated, the endpoint of which would represent the development of 
a DH reaction. 

Any given amino acid difference between the CTG-encoded 
25 product and the proto-oncogene-encoded product has a low probability of being 
a " tolerance-breaker Thus, it is preferable to transfect the host cells with a 
mixture of multiple different CTGs, preferably dCTGs. The number of 
different dCTGs is preferably five or more. Moreover, it is preferred that, 
among themselves, the multiple dCTGs show amino acid differences that 
30 essentially extend throughout the polypeptide chains of the encoded sequences. 
The dCTGs would be selected to maximize amino acid differences and, at the 
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same time, make sure that differences are found all along the polypeptide chain. 
It would thus not be preferable to select a battery of deletions all from within 
the same domain of the polypeptide chain. 

According to a protocol which utilizes 10 7 irradiated cells for 
5 immunization containing five separate dCTGs, five groups of 2 X 10 6 cells are 
included in one inoculate, each group of 2 X 10 6 having been transfected with 
a separate dCTG from the total set of five CTGs that are cognate to a particular 
proto-oncogene. 

Selection of Non-Transforming Cognate Transgenes 

10 Non-transforming cognate transgene variants are most 

advantageously derived via deletion of a sequence essential for transformation. 
Unlike point mutations which are potentially reversible due to back mutations, 
deletion mutations are irreversible. Furthermore, deletion mutations do not 
possess the inherent disadvantage attaching to point mutations, namely, even 

15 though the requirement for generation of an acceptable cognate transgene is for 
a qualitative difference with the wild type, i.e., non- transforming versus 
transforming, any given point mutation may be neutral or else quantitative in 
its effect, that is, the mutation may reduce but not totally eliminate 
transformability. Thus, according to a preferred embodiment of the invention, 

20 a deletion is created in a region of the cognate transgene which encodes an 
amino acid sequence required for transformation. Consonant with non- 
transformability, the smallest deletion possible so as to leave intact the bulk of 
the antigenicity of the transgene product is selected. 

The engineering of a cognate transgene deletion mutant that 

25 satisfies these criteria is facilitated by reports of structure-function relationship 
in oncogene-encoded proteins. Such reports serve to identify regions of 
oncoproteins that are essential for transformation, as opposed to regions which 
are either neutral or serve merely to modulate transformability. Although such 
reports are usually based on in vitro transformation assays, and are therefore 

30 independent of immune effects, these studies can be exploited to aid in the 
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30 



construction of non-transforming dCTGs for use in the practice of the present 
invention. 

The deletion mutant is engineered to include at least a part of the 
region identified as critical for transformation. In those cases where essential 
amino acids have been identified, the deletion will span these residues. The 
engineering of any desired deletion can be readily accomplished by polymerase 
chain reaction (PCR) according to conventional PCR techniques, based upon the 
known nucleotide sequence of the unmutated cognate transgene. 

The following describes a representative protocol for deriving a 
non-transforming dCTG of the smallest possible deletion, for use in the practice 
of the present invention. A test dCTG, engineered on the basis of known or 
ascertained transformation-specific domains, and driven by the strongest possible 
promoter, is used to transfect murine 3T3 cells. A sister culture of 3T3 cells 
is also transfected, with non-deleted CTG. Each CTG or dCTG cell culture is 
15 inoculated into nude mice, in the absence of any treatment to render the cells 
non-dividing. Those dCTGs which do not yield tumors in the mice even after 
prolonged observation are then utilized as transgenes for the biopsied human 
cells which, upon transfcction with the transgene, will serve as a cellular 
vaccine according to the practice of the present invention. The dCTGs are 
selected with the smallest deletion mutant consonant with non-transformability. 

Some CTGs representing xenogenic proto-oncogenes may not be 
tumorigenic in the 3T3/nude mouse assay. For any such non-transforming 
CTG, it is not essential to generate a dCTG. However, even given non- 
tumorigenicity in nude mice, it may be desirable to opt for generation of a 
deletion mutant when the transgene is based upon a xenogenic proto-oncogene. 

In such cases, the deletion would be engineered so as to remove the 
homologous region to that deleted in the particular dCTG that corresponds to 
the deletion in the corresponding retroviral oncogene dCTG. 

Even though the transgene construct may comprise mutant 
oncogene or proto-oncogene DNA which is nontransforming, it is nevertheless 
preferable, as a safety measure, to treat the transfected cells to render them non- 
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dividing before inoculation back into the host. The cells are irradiated with a 
radiation dosage sufficient to render them non-dividing. 

Oncogenicity Assay of Cognate Transeenes 

As a further safety measure, the oncogenicity of a given dCTG 
5 is preferably thoroughly tested prior to infection of the human host cells which 
are used as cellular immunogens according to the practice of the present 
invention. For example, an oncogenicity testing regimen may take the form of 
three separate assays: (i) dCTG transfection of NIH 3T3 cells, followed by 
inoculation into nude mice; (ii) dCTG transfection of human fibroblasts, 

10 followed by inoculation into nude mice; and (iii) dCTG transfection of human 
fibroblasts, followed by an in vitro test of anchorage-dependent growth. In 
principle, all three should be negative to validate the use of any given dCTG in 
the vaccination method of the present invention. 

According to the oncogenicity assay (i), after stable transfection 

15 of NIH 3T3 cells with the test dCTG, the transfectants are inoculated into nude 
mice. Tumorigenicity of the transfectants in the mice is then evaluated 
according to standard protocols. 

According to oncogenicity assay (ii), human fibroblasts are 
transfected with the test dCTG as proposed in the above human immunization 

20 protocol. After stable dCTG transfection of human fibroblasts, however, rather 
than carrying out X-irradiation of the transfectants to render them non-dividing, 
followed by inoculation of the irradiated transfectants back into the human host, 
the transfectants are directly inoculated into nude mice as a direct test of 
tumorigenicity. Given the greater susceptibility of murine 3T3 cells to 

25 oncogenic transformation, vis a vis primary human or murine transfectants 
fibroblasts, assay (ii) is probably much less sensitive than assay (i), but does 
have the advantage of offering a direct test of dCTG oncogenicity in human 
cells. 

According to oncogenicity assay (iii), non-irradiated dCTG- 
30 transfected human fibroblasts are assayed for anchorage-dependent growth, i.e. 
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colony formation in soft agar, as a test of dCTG transforming potential in 
human cells. Anchorage independence, as defined by the ability of cells to 
grow when suspended in semisolid medium, is a common phenotype acquired 
by human tumor cells, particularly those tumor cells of mesenchymal origin, 
5 such as fibrosarcomas. While assay (iii) has no in vivo readout, it offers an 
independent test of the critical issue of dCTG oncogenicity in human cells. 

The oncogenicity assays are performed according to published 
protocols. Assay (i), comprising dCTG transfection of NIH 3T3 cells followed 
by inoculation into nude mice, may be performed according to the protocol of 
10 Stevens et aL, Proc. Natl. Acad. ScL USA (1988), 85:3875-3879, including 
DNA transfection by the calcium phosphate coprecipitation method of 
Manohavene/a/., Carcinogenesis (1985), 6:1295-1301. Accordingly, NIH 3T3 
cells (7.5 X 10 5 cells per 100-mm dish) are exposed to a calcium phosphate- 
DNA coprecipitate (40 /xg of genomic DNA plus 3 /ig of pS V2neo per dish) for 
15 4 hours. Two days later, each dish is trypsinized and reseeded into a 175-cnr 
flask. For the next 10 days, cultures are selected in G418 (400 /*g/ml), and the 
flasks are then trypsinized and cells are replated in the same flask to disperse 
the G418-resistant colonies into a diffuse lawn of cells. Two days later, the 
cells are harvested and washed with serum-free medium prior to injection. One 
20 injection of 5 X 10 6 cells into the right flank and one injection of 1 X 10 7 cells 
into the left flank, each in a volume of 200 pi are done on each nude mouse. 
Injection sites are monitored at 3- or 4-day intervals for 100 days. The sites are 
scored for the number of tumors induced per injection site. 

Oncogenicity assay (ii), whereby dCTG transfection of human 
25 fibroblasts followed by inoculation into nude mice, is carried out in the same 
manner as assay (i) except that for assay (ii) the human fibroblast transfectants 
are substituted for the murine 3T3 transfectants. 

Assay (iii), involves a test of the in vitro anchorage-dependent 
growth of dCTG-transfected human fibroblasts The assay is carried out as 
10 described in Stevens et aL, J. Cancer Res. and Clin. Oncol. 1989, 115:118- 
128. 1 x 10 5 cells are seeded per 60-mm dish into 0.33% Noble agar over a 
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6-ml 0.5% agar base layer in Hams F10 supplemented with 6% fetal bovine 
serum. A portion of the agar suspension is diluted with Hams F10 plus 6% 
fetal calf serum to 200 cells/5 ml to determine the cloning efficiency of these 
cells when seeded into plastic 60-mm dishes. Agar dishes are fed with 1 ml 
5 Hams F10 supplemented with 6% fetal bovine serum on the 1st and 15th day 
after seeding. Four weeks after seeding, all agar colonies > 75 /xm in diameter 
are counted and the colony counts are normalized to the plating efficiencies 
which aliquots of the initially seeded cells showed on plastic. This comparison, 
or normalization, of the agar colony counts to the plastic dish colony counts is 
10 useful in identifying and correcting for any mechanical artifacts which might 
result from the seeding into agar of dead cells that had persisted from the initial 
transfection treatment or from heat-induced cell death, which might have 
occurred while suspending cells in molten agar during the process of seeding the 
agar dishes. 

15 The following is a partial list of various deletions which, based 

upon published accounts of experiments with human or animal cells, are 
believed to render the identified CTG non-tumor igenic. 
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Eneineerin g of Vectors for Host Cell Transfection 

The engineering of vectors for expression of a particular CTG, 
preferably a dCTG, is based on standard methods of recombinant DNA 
technology, i.e. insertion of the dCTG via the polylinker of standard or 
commercially available expression vectors. The dCTG is operably linked to a 
strong promoter. Generally speaking, a "strong" promoter is a promoter which 
achieves constitutively high expression of the dCTG in the transfected cells. 
Each promoter should include all of the signals necessary for initiating 
transcription of the relevant downstream sequence. These conditions are 
fulfilled, for example, by the pBK-CMV expression vector available from 
Stratagene Cloning Systems, La Jolla. CA (catalog no. 212209). The pBK- 



WO 97/25860 



PCMJS97/00582 



- 38 - 

CMV vector contains the cytomegalovirus (CMV) immediate early promoter. 
dCTGs xenogenic with respect to a particular target proto-oncogene may be 
isolated by conventional nucleic acid probing techniques, given the availability 
of a highly homologous probe represented by the cognate retroviral oncogene 
5 and/or the human proto-oncogene itself. 

Collection of Host Cells for Transfection 

The host cells which may be transfected to derive the cellular 
immunogens of the present invention must express class I MHC and be 
susceptible to isolation and culture. Fibroblasts express class I MHC and may 
be cultured. Accordingly, punch biopsies of host human skin are performed to 
harvest fibroblasts. Punch biopsies can be performed by a competent physician 
as a standard clinical procedure. Each biopsy yields a starting population of 1-2 
X 10 7 cells that would proliferate in culture. Methods for the preparation of 
tissue cultures of human fibroblasts are well developed and widely used. See, 
Cristofalo and Carpenter, /. Tissue Culture Methods (1980), 6:117-121, the 
entire disclosure of which is incorporated herein by reference. Essentially, skin 
obtained by punch biopsy is washed using an appropriate wash medium, finely 
minced and cultured in a suitable culture medium, such as Dulbecco's Modified 
Eagle Medium (DMEM), under C0 2 at 37 °C. The cells are trypsinized with 
a trypsin solution and transferred to a larger vessel and incubated at 37 °C in 
culture fluid. 

Host Cell Transfection 

The expression vector carrying the dCTG is used to transfect 
biopsied host cells according to conventional transfection methods. One method 
25 of transfection involves the addition of DEAE-dextran to increase the uptake of 
the naked DNA molecules by a recipient cell. See McCutchin and Pagano, J. 
Natl. Cancer Inst. (1968) 41:351-7. Another method of transfection is the 
calcium phosphate precipitation technique which depends upon the addition of 
Ca ++ to a phosphate-containing DNA solution. The resulting precipitate 
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apparently includes DNA in association with calcium phosphate crystals. These 
crystals settle onto a cell monolayer; the resulting apposition of crystals and cell 
surface appears to lead to uptake of the DNA. A small proportion of the DNA 
taken up becomes expressed in a transfectant, as well as in its clonal descen- 
5 dants. See Graham et al.. Virology (1973), 52:456-467 and Virology (1974), 
54:536-539. 

Preferably, transfection is carried out by cationic phospholipid- 
mediated delivery. In particular, polycationic liposomes can be formed from 
N-tl-(2,3-dioleyloxy)propyl]-N,N,N-trimethylammonium chloride (DOTMA) 

10 or related liposome-forming materials. See Feigner et al., Proc. Natl. Acad. 
Sci. USA (1987) 84:7413-7417 (DNA-transfection); Malone et al., Proc. Natl. 
Acad. Sci. USA (1989), 86:6077-6081) (RNA-transfection). One preferred 
technique utilizes the LipofectAMINE™ Reagent (Cat. No. 18324-012, Life 
Technologies, Inc., Gaithersburg, MD) which is a 3:1 (w/w) liposome 

15 formulation of the polycationic lipid 2,3-dioleyloxy-N- 
[2(sperminecarboxamido)ethyl-N, N-dimethy I- 1 -propanaminium trifluoroacetate 
(DOSPA) (Chemical Abstracts Registry name: N-[2-((2,5-bis[(3- 

aminopropyl)amino]-l-oxypentyl}amino)ethyl]-N,N-dimethyl-2,3-bis(9- 
octadecenyloxy)-l -propanaminium trifluoroacetate), and the neutral lipid 
20 dioleoyl phosphatidylethanolamine (DOPE) in membrane filtered water. 
Transfection utilizing the LipofectAMINE™ Reagent is carried out according to 
the manufacturer's published protocol. The protocol (for Cat. No. 18324-012) 
provides for either transient or stable transfection, as desired. 

The advantage of transient expression is its rapidity, i.e. there is 
25 no requirement for cellular proliferation to select for stable integration events. 
This rapidity could conceivably be of major clinical importance, in cases of an 
already metastatic tumor burden, wherein the weeks required for selection of 
stable transfectants may simply not be available to the clinician. 

There are, nonetheless, two general disadvantages to the use of 
10 transient transfection. The first is that expression usually peters out after a few 
days, in contrast to the continual expression in the case of stable transfection. 
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This is not particularly crippling in terms of our immunization protocol. The 
inoculated, irradiated cells used for immunization would likely not survive in 
vivo for more than 4 or 5 days, in any case. Thus the nominal advantage 
accruing to stable transfection, that of a long-duration expression by the progeny 
5 of the parental inoculated cell, is not of particular relevance in the case of the 
immunizing regime described herein, which is based on the use of non-dividing, 
probably short-lived cells. 

A second disadvantage of transient transfection resides in the fact 
that it yields a cell population, only a subset of which has actually been 

10 transfected and thus expresses the protein encoded by the transgene. This 
problem is obviated in the case of stable transfection, wherein over time one can 
develop a pure population of transfectants via selection for a resistance marker, 
such as neo, under conditions of clonal proliferation of the initial stable 
transfectants, i.e. daughter cells of transiently transfected cells lack the 

15 transgene, in contrast to the case with stable transfectants. In the situation 
where there is sufficient time to effect immunization based on stably transfected 
cells, the progeny of all transfected clones would be utilized, not just the 
progeny of a single clone, as is sometimes done for detailed biochemical and 
molecular analyses of gene expression. Clearly the more clones utilized, the 

20 more quickly one can arrive at the requisite number of cells to be used for 
immunization. 

Percentage of Cells Exhibiting dCTG Expression 

The percentage of cells exhibiting dCTG expression may be 
determined by an immunohistology assay. In this procedure, a small number 

25 of cells ( ~ 500) from the harvested pellet following centrifugation of transfected 
cells are deposited on a cover slip and fixed with cold acetone. At this point, 
a standard immunohistological assay is carried out with the cells on the cover 
slip, i.e. addition of a primary monoclonal antibody reactive to the dCTG- 
encoded protein, followed by the addition of a developing antibody, e.g. a 

30 fluorescent tagged antibody reactive to the primary monoclonal antibody. 
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Measurement of the percentage of cells scoring as dCTG-positive in the 
fluorescent assay allows a determination of the number of positive transfectants 
in the starting culture, and thus the number of total cells to be used for 
immunization to arrive at the desired number of dCTG-positive cells to be 
5 inoculated in the patient. 

If, as would be almost certain, the percentage of cells scoring as 
dCTG-positive is less than one hundred percent, one can simply increase the 
number of cells to be used for immunization, so as to include the desired 
number of transfectants. The non-transfected cells in the immunizing population 
0 would simply represent x-irradiated, autologous fibroblasts that would constitute 
no danger to the patient. 

Transfectant Irradiation 

Prior to return to the host, the transfected cells are preferably 
irradiated. The transfectants are irradiated with a radiation dose sufficient to 
> render them non-dividing, such as a dose of 25 By or 2500R. The cells are 
then counted by trypan blue exclusion, and about 2 X 10 7 irradiated 
transfectants are resuspended in a volume of 0.2-0.4 ml of Hanks Balanced Salt 
Solution. 



Vaccination Procedure 

The transfected cells are returned to the host to achieve 
vaccination. The cells may be reimplanted at the same body site from which 
they were originally harvested, or may be restored to a different site. 

It is the object of the present invention to generate a systemic 
tumor immune response, so as to fight metastasis formation wherever any 
metastases are found. Accordingly, there is no reason to inject the transfected 
cells at the same body site from which they were taken. Intramuscular or 
subcutaneous inoculation at a distal site would suffice to yield a systemic 
response. Thus, patients are preferably vaccinated by subcutaneous inoculation 
of the transfected cells. 
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For s-crc overexpression associated with colon carcinoma, partial 
venous inoculation is preferred, as the liver is a frequent site of metastases. For 
vaccinating against breast cancers and lymphomas, systemic immunization is 
preferred. 

5 As a general rule, it is desirable to generate the strongest immune 

response consistent with clinical monitoring of no adverse side effects, i.e. 
multiple rounds of inoculation with, for example 10 7 cells, at each round. The 
number of rounds of inoculation is selected accordingly. The efficacy of the 
inoculation schedule may be monitored by a delayed hypersensitivity reaction 

10 administered to the patient. A course of about up to 10 inoculations, at 2-3 
week intervals, may be utilized. It may be appreciated that the inoculation 
schedule may be modified in view of the immunologic response of the 
individual patient, as determined with resort to the delayed-type hypersensitivity 
(DTH) reaction. 

15 Patient Response Monitoring bv Delaved-tvpe Hy persensitivity Reaction 

Patients are assessed for reactivity to the irradiated transfectants 
by a test of skin reactivity in a DTH reaction, DTH has been used clinically 
(Chang et al (1993), Cancer Research 53:1043-1050). To measure reactivity 
to the autologous irradiated transfectants, 10 4 - 10 6 cells in a volume of 0.1 ml 

20 Hanks buffered saline solution (HBSS) are inoculated intradermal^ into the 
host. Induration is measured 48 hours later, as an average of two perpendicular 
diameters (responses of greater than >2 mm is considered positive). 

One advantage to the DTH assay is that it can independently 
assess the induction of T cell reactivity to (i) the transfectants used for 

25 immunization {i.e. the set of 5 or more dCTGs chosen for immunization 
purposes, each containing non-self determinants) and (ii) transfectants, as 
transfected with the human dCTG itself containing only self determinants. 
Thus, the induction of reactivity to the transfectants used for immunization 
establishes that the immunizing transfectants are in fact immunogenic, that is, 

30 the patient has not exhibiting a much weakened capacity for immune response. 
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If the patient is demonstrably capable of response to the immunizing 
transfectants, then skin testing with the dCTG (human) transfectants would 
establish whether or not reactivity to the human proto-oncogene encoded product 
had been induced. According to the practice of the invention, inoculation of the 
immunizing transfectants would continue for at least as long as the induction of 
reactivity to the human proto-oncogene-encoded protein occurs. 

The practice of the invention is illustrated by the following 
nonlimiting examples. 



Example 1 

0 Immunization of Chick ens Against c-srrtSlTi-Indnceti 

Tumors Bv Vaccination with v-src DNA 

A. Genes 

The oncogene c-j/r(527) is an activated form of chicken c-src. 
Its protein product ppoO^ 5271 differs from the protein product of c-src, pp60 c 
> src , by only a single amino acid substitution, phenylalanine for tyrosine at 
residue 527 (Kmiecik and Shalloway, (1987) Cell 49, 65-73). This substitution 
eliminates the negative regulatory influence exerted on ppoO"" phosphokinase 
activity by the enzymatic phosphorylation of the position 527 tyrosine. The 
protein product of v-src, pp60 v sre , shows a number of sequence differences with 
pp60< sre (Takeya and Hanafusa, (1983) Cell 32, 881-890), including scattered 
single amino acid substitutions within the first 514 residues and a novel C 
terminus of 12 amino acids (residues 515-526), in place of the nineteen C 
terminal amino acids of p P 60" rc (residues 515-533). Both the v-jrc-positive 
plasmid, pMvsrc. and the c-jrc(527)-positive plasmid, pc*rc527, were originally 
shown (Kmiecik and Shalloway, (1987) Cell 49, 65-73) to transform murine 
NIH 3T3 cells in culture. However, the v-j/r- induced transformants exhibited 
a more rapid or more extensive colony growth in soft agarose than the c- 
src(527)-induced transformants, as well as a usually shorter latency of tumor 
formation in nude mice (id.). 
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B. Plasmids 

1. pvSRC-Cl 

The pVSRC-Cl plasmid was prepared as described by Halpern 
et al. , (1991) Virology 180, 857-86. Essentially, the plasmid was derived from 

5 the pRU-src plasmid (Halpern et al., (1990) Virology 175, 328-331) by 
subcloning the v-src(+) XhohEcoRl fragment of the latter into the multiple 
cloning sequence of pSP65 (Melton et al , (1984) Nucleic Acids Res. 12, 7035- 
7056) which had been cleaved with Sail and EcoRl; since ligation of the Xhol 
overhang at the Sail site destroys both recognition sequences, subsequent 

10 removal of the v-src( + ) insert from the vector was achieved by digestion with 
EcoKl and with //mdlll, which cleaves at a position in the multiple cloning 
sequence adjacent to the Sail site. The pVSRC-Cl plasmid was restricted with 
EcoHl and Hindlll, so as to liberate the tumorigenic insert. This insert included 
the \-src oncogene of the subgroup A strain of Prague RSV, as flanked 

15 downstream by a portion of the long terminal repeat (LTR) of RSV (from the 
5' start of the LTR, to the single Ea?Rl site). 

2. pMvsrc 

The pMvsrc plasmid was generously provided by Dr. David 
Shalloway, Cornell University, Ithaca, NY. The plasmid is prepared according 

20 to Johnson et al, (1985) Mol. Cell. Biol. 5, 1073-1083. Briefly, the 3.1-kb 
BamUVBglW Schmidt Ruppin A v-src fragment from plasmid pN4 (Iba et al. , 
(1984) Proc. Nat. Acad. Sci. USA 81, 4424-4428) is inserted into the pEVX 
plasmid (Kriegler et al., (1984) Cell 38,483-491) at a Bgtll site lying between 
two Moloney murine leukemia virus (MoMLV) long terminal repeats (LTRs). 

25 This fragment contains 276 bp of pBR322 DNA from the pBR322 BamHl to 
Sail sites followed by 2.8 kb of Rous sarcoma virus (RSV) DNA from the Sail 
site that is about 750 bp upstream of the env termination codon down to the 
Nrul site that is about 90 bp downstream of the \-src termination codon. (The 
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Nrul site is converted to a BglW site in the construction of pN4.) Ligation is 
performed by using a 10:1 insert- vector DNA fragment molar ratio. 

The pMvsrc plasmid was restricted with Nhel, so as to liberate 
a tumorigenic fragment. The fragment included the \-src oncogene of the 
subgroup A strain of Schmidt-Ruppin RSV, as flanked upstream by most of the 
Moloney murine leukemia virus (MoMLV) LTR (from the Nhel site near the 
5' start of the LTR, to the 3' end of this LTR) and downstream by a small 
portion of the MoMLV LTR (from the 5' start to the Nhel site). 

3. pcsrc527 

The pcsrc527 plasmid is prepared according to Kmiecik and 
Shalloway, (1987) Cell 49, 65-73. Briefly, a plasmid is constructed by cleaving 
expression vector pEVX (Kriegler et al., (1984) Cell 38,483-491 at its unique 
BgUl site lying between two MoMLV LTRs and inserting the 3.2 kilobase (kb) 
pair Bamm-Bglll hybrid src fragment from plasmid pHB5 in the proper 
orientation. This fragment contains sequences from pBR322, the SRA env 3' 
region, SRA v-src, src from recovered ASV, and chicken c-src. The fi^III site 
is generated by insertion of a linker at the Sad site about 20 bp downstream 
from the c-src termination codon. The restriction map of pMHB5 contains the 
MoMLV splice donor about 60 bp downstream from the 3 'end of the upstream 
LTR and the v-src splice acceptor about 75 bp upstream from the src ATG. 

Plasmid pMHB5527 is constructed by inserting the synthetic 
double-stranded DNA oligomer 

5' CCAGTTCCAGCCTGGAGAGAACCTATA (SEQ ID NO : 1 ) 3. 

3' TCGGGGTCAAGGTCGGACCTCTCTTGGATATCTAG (SEQ ID NO: 2) 5' 

into P MHB5 between the Banll site at c-src codon 524 and the downstream 
unique 5^111 site. This alters the TAC Tyr 527 codon to a TTC Phe codon 
while preserving the remaining c-src coding region. Equimolar amounts of the 
double-stranded oligomer and three gel-purified tandem restriction fragments 
from pMHB5 are ligated in one reaction, which contains the following: the 
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oligomer with Barill and flglll complementary ends, the 3 kb Bglll-Bgll (Bgll 
in the pEVX ampicillin resistance gene) partial digest fragment, the adjacent 6. 1 
kb BglhBgll (downstream Bg/7 in c-src) fragment, and the 0.38 kb Bg\\-Ban\\ 
(BanW at c-src codon 524) fragment. 

5 Plasmid pcsrc527 is constructed by replacing the 2 kb Sail (in 

env)-Mlu\ (in c-src) fragment in plasmid pMHB5527, with the homologous 
fragment from plasmid p5H. This fragment contains the coding sequence for 
the c-src amino region (codons 1 to 257) that have been isolated by molecular 
cloning of a c-src provirus and previously shown by sequencing to contain 

10 authentic c-src sequence without the mutation at codon 63 (Levy el al , (1986) 
Proc. Natl Acad. Sci. USA 83, 4228-4232). Equimolar amounts of 
complementary gel-purified SaR-Mlu\ fragments from p5H and the other 

plasmids are ligated. 

The pcsrc527 plasmid was restricted with Nhel y so as to liberate 
15 a tumorigenic fragment. The tumorigenic fragment included the c-src(521) 
oncogene, as flanked by the same LTR complement as in pMvsrc. 

C. Animals 

Chickens of two closed lines, SC and TK, were utilized. These 

lines differ at the major histocompatibility (B) complex (Bt/B? for the SC line, 
20 B^/B 11 for the TK line). Embryonated eggs were obtained from Hyline 
International (Dallas Center, IA). All chickens were hatched at the University 
of New Hampshire Poultry Research Farm and housed in isolation. 

D. Tumor Induction bv Plasmid DNA 

Tumors were induced by subcutaneous inoculation in the wing 
25 web of a jrc-positive plasmid according to the technique described by Fung et 
al (1983) Proc. Natl Acad. Scl USA 80, 353-357 and Halpem etal, (1990) 
Virology 175, 328-331. Of the three tumorigenic plasmids utilized here, all 
were adjusted, prior to inoculation, to a concentration of 100 /*g of enzyme- 
restricted DNA per 100 /il of phosphate-buffered saline. The conditions of 
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inoculation used for particular experiments (age of chicken at time of 
inoculation, amount of plasmid, etc.) are indicated below. 

E Growth of Primary f win P web> Tumors i n TK or SC Chiclcms 

Inoculated with oVSRC -Cl. nMv.crr or ncsreSTI 
Individual 1 -day-old chickens of line TK or of line SC were 
inoculated with 100 ig of either pVSRC-Cl, pM vsrc or pcjrc527. The mean 
tumor diameter (mm) at a particular time point and for any one group of TK or 
SC line chickens inoculated with an individual src-positive construct was 
computed as the sum of the diameters of the primary tumors divided by the 
number of chickens surviving to that point. The results are shown in Fig. 1 A 
(line TK) and Fig. IB (line SC). The ratios at each time point show, for a 
particular group, the number of chickens bearing palpable tumors to the total 
number of survivors to that point (standard typeface for pcjrc527, italics for 
pVSRC-Cl, bold typeface for pM Vsrc). Error bars (unless obscured by the 
symbol) indicate standard error. 



F - Growth of Challenge (win g weh> Tumors in Test and rnntml 

Line TK Chickens I Inde x Conditions of Priming an d Homolog ous 
Challenge with ncsrcSll. or Priming and Homologous Challpng P 
with oVSRC-Cl 

Growth of challenge (wing web) tumors in test and control line 
TK chickens was determined under conditions of (i) priming and homologous 
challenge with p«/r527, or (ii) priming and homologous challenge with 
pVSRC-Cl. Test chickens were primed at 1 day posthatch with 100 Ig of 
construct; test and control chickens were challenged at five weeks posthatch 
with 200 Ig of construct. The mean challenge tumor diameter was computed 
as described in the preceding section. At each time point the ratio of 
chickens bearing palpable challenge tumors to total number of survivors to 
that point is indicated for priming and homologous challenge with pcsrc527 
(Fig. 2A) and priming and homologous challenge with pVSRC-Cl (Fig. 2B) 
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(standard typeface for control group, bold typeface for test group). The 
statistical comparison between the mean challenge tumor diameters of the test 
versus the control group at a particular time point was made using a two-tailed 
student's t test, *(p<0.05), **(p<0.01), ***(p<0.001). The statistical 
5 comparison between the ratios of chickens bearing palpable challenge tumors to 
total number of survivors of the test versus the control group at a particular 
time point was made using a chi-squared test; the paired ratios are underlined 
for only those time points where p<0.05. Error bars indicate standard error. 



G. Growth of Challenge (wing webl Tumors in Test and Control 

10 line TK chickens under Conditions of Priming with pVSRC-Cl 

and Heterologous Challenge with pcsrc527, or Priming with 
pcsrc527 and Heterologous Challenge with pVSRC-Cl 
Growth of challenge (wing web) tumors in test and control line 
TK chickens, was determined under conditions of (i) priming with pVSRC-Cl 

15 and heterologous challenge with pcjrc527, or (ii) priming with pcsrc527 and 
heterologous challenge with pVSRC-Cl. Test chickens were primed at 1 day 
posthatch with 100 j*g of construct; test and control chickens were challenged 
at five weeks posthatch with 200 /xg of construct. The mean challenge tumor 
diameter was computed as described in Section E. At each time point the ratio 

20 of chickens bearing palpable challenge tumors to total number of survivors to 
that point is indicated for priming with pVSRC-Cl and heterologous challenge 
with pcsrc527 (Fig. 3 A) and priming with pcsrc527 and heterologous challenge 
with pVSRC-Cl (Fig. 3B) (standard typeface for control group, bold typeface 
for test group). Statistical comparisons were made between test and control 

25 groups at a particular time point as described in the preceding section 
[*(p<0.05), **(p<0.01), ***(p< 0.001), for the student's t test], and the 
paired ratios are underlined for only those time points where, in the chi-squared 
test, p<0.05. Error bars indicate standard error. 
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H. Discussion 

In a direct comparison of the growth of tumors induced in line 
TK by either pMvsrc or pVSRC-Cl, a similar pattern of relatively rapid 
regression was observed. This result established that the difference in LTR 
complement between these two v-src positive constructs did not exert a major 
influence on the tumor growth pattern in the TK line (Fig. 1A). By contrast, 
much more extensive and persistent tumor growth resulted from inoculation of 
TK chickens with the parc527 construct (Fig. 1A). The relatively greater 
growth capacity of tumors induced by this construct indicated that in the TK 
line, the c-src(527) oncogene is much more highly tumorigenic than the v-src 
oncogene. This difference did not, however, generalize to the SC line (Fig. 
IB). The SC line was chosen for comparison with the TK line on the basis of 
earlier observations (Halpern et al., (1993) Virology 197, 480-484) that v-src 
DNA-induced tumors engender a much weaker tumor immune response in line 
SC than in line TK. Whereas the growth of pwrc527-induced primary tumors 
was virtually indistinguishable in the two lines, the growth of the v-src-induced 
tumors was considerably greater in the SC than in the TK line (Figs. 1A and 
IB). Thus v-src, but not c-*r C (527), gives rise to primary tumors whose growth 
patterns differ in the two lines analyzed here. 

Only minimal protection against homologous challenge was 
observed under conditions of priming to c-src(527) DNA, indicative of the 
induction of a relatively weak tumor immune response (Fig. 2A; a statistically 
significant lowering of challenge tumor growth in the test versus the control 
chickens was observed at only one time point). By contrast, the v-src DNA- 
primed chickens showed excellent protection against the homologous tumor 
challenge (Fig. 2B). 

Priming with v-src DNA engenders a relatively greater degree 
of protection against challenge with c-jrc(527) DNA, than that afforded by 
priming with c-,rc(527) DNA itself (Fig. 3A). The degree of protection was 
weaker than that determined (Fig. 2B) for the case of priming and 
homologous challenge with v-src DNA. Only marginal protection was 
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observed, however, when the heterologous challenge protocol was carried out 
in the reverse order (Fig. 3B). These results demonstrate that induction of 
reactivity to an antigenicity specified in tumor cells by an overexpressed proto- 
oncogene can confers tumor immunity. 

Example 2 
Vaccination Protocol 

The following is a representative vaccination protocol according 
to the present invention. 

A. Skin Punch Biopsy 

A punch biopsy of skin is obtained by a trained physician 
following standard medical practice. 

B. Preparation of Primary Fibroblast Culture 

Under sterile conditions, the skin obtained by punch biopsy is put 
in a tube with 10 ml of the following wash medium: Dulbecco's Modified 
Eagle Medium (DMEM), containing sodium bicarbonate (30 ml/liter of a 5 .6% 
solution) and penicillin/streptomycin (2 ml/liter of a pen-strep stock solution 
containing 5000 units penicillin and 5000 /xg of streptomycin/ml, pH 7.2-7.4.). 
In a sterile hood, the skin biopsy is added to a Petri dish, and then transferred 
several times to new Petri dishes containing the same wash medium. The 
biopsy is then finely minced with two scalpels, and 2-4 pieces ( < 1 mm 3 ) of the 
minced biopsied are placed in the middle part of one or more T25 flasks . The 
flask is placed in a tissue culture incubator at 37 °C for one half hour with the 
cap firmly closed, then opened for 10 minutes. The following culture medium 
is prepared: DMEM containing sodium bicarbonate; antibiotics; and 10% fetal 
calf serum containing 2.5 /xg/ml fungizone, 40 /Ag/ml gentamicin, and 1% 
glutamine( 3 % W/V). Two ml of the culture medium is then added to the flask, 
and the flask is incubated at 37°C (5% C0 2 ), with the cap lightly unscrewed. 
The flask is left for three days without moving so as to obtain adhesion of the 
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separate pieces of skin to the plastic. Afterwards, the medium is changed two 
times per week over a 3-4 week period always adding 2-3 ml of medium. To 
trypsinize the skin cell culture, one needs zones of confluence. After aspirating 
the culture medium, 5 ml of the Puck's Saline A/EDTA solution (0.4 g EDTA 
5 to 1 liter of Puck's Solution A) is added and immediately aspirated. Then 1 ml 
of trypsin solution (0.05/0.02% trypsin in PBS, without Ca++ or Mg++) is 
added and incubated for 5 min at 37°C, at which time 2 ml of culture fluid is 
added to stop the action of the trypsin. The cells are then transferred to a larger 
flask (T75) and incubated at 37°C in 15 ml of culture fluid, which is changed 
10 every 2 days. 

C Fibroblast Transfection 

The fibroblasts (2 X 10 s cells) are washed twice in DMEM 
without serum or antibiotics. A LipofectAMINE™-DNA solution is prepared 
by mixing in tube #1 mix 400/d DMEM and 10/xl of dCTG vector DNA 
15 (1/ig/ul). In tube #2 , 400 /tl DMEM and 25 Ml of LipofectAMINE Reagent 
(Life Technologies, cat. no. 18324-012) are mixed. The contents of tube ffl 
and #2 are mixed together and are then left sitting at room temperature for 30 
hours. Then, 3.2 ml of the LipofectAMINE™-DNA solution is added to the 
cells. The cells are incubated for six hours at 37°C, washed once with Hank's 
Balanced Salt Solution, and then refed with growth medium and incubated for 
an additional 24 hours at 37 °C 

D. Transfectant Irradiation 

Transfectants are irradiated to a dose of 25 By or 2500R. the 

cells are then counted by trypan blue exclusion. 2 X 10 7 irradiated transfectants 
are resuspended in a volume of 0.2-0.4 ml of Hanks Balanced Salt Solution. 

E. Vaccination 

Patients are vaccinated by subcutaneous inoculation of 2 X 10 7 

irradiated cells at 2-3 week intervals. A shorter or longer regimen is used. 
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depending upon the results of delayed type hypersensitivity (DTH) reaction 
monitoring (described below). 

F. Patient Assessment bv DTH Monitoring 

Patients are assessed for reactivity to the irradiated transfectants 

5 by a test of skin reactivity in a DTH reaction, as described by Chang et al. 
(1993), Cancer Research 53:1043-1050. To measure reactivity to the 
autologous irradiated transfectants, 10 4 - 10 6 transfected irradiated cells in a 
volume of 0.1 ml HBSS are inoculated intradermal^ . Induration is measured 
48 hours later, as an average of two perpendicular diameters. Responses of 

10 greater than 2 mm are considered positive. 

Example 3 
v-mvc Transfection of Murine Fibroblasts 

A. Vector Preparation 

The v-myc retroviral oncogene of avian myelocytomatosis virus 
15 MC29 (Land et al (1983), Nature 304:596-602) was obtained from the 
American Type Culture Collection, Rockville, MD, 20852, as the pSVv-myc 
vector (ATCC No. 45014). The v-myc-positive EcoW-Kpnl fragment of pSVv- 
myc was ligated into the polylinker sites of the pBK-CMV plasmid (Stratagene 
Cloning Systems, La Jolla, CA). 

20 B. Cell Transfection 

Stable transfection using the pBK-CMV-v-myc vector was carried 
out on a line of A31 fibroblasts (Balb/c origin), obtained from the ATCC. 2 
X 10 5 cells were seeded in a 100 mm/dish and allowed to grow for 18-20 h 
(RPMI 1640 medium and 10% fetal bovine serum), at which time the cells 

25 reached 50-70% confluence. The cells were then washed twice in Dulbecco's 
Modified Eagles Medium (without serum or antibiotics). A LipofectAMINE™- 
DNA solution was prepared according to Example 2.C. , with the pBK-CMV-v- 
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myc vector DNA, and 3.2 ml of the LipofectAMINE™-DNA solution added to 
the cells. The cells were then incubated for 6 hours at 37 °C, washed once with 
Hank's Balanced Salt Solution, and then refed with the growth medium and 
incubated for an additional 24 hour at 37°C. Thereafter, the cells were fed 
once every two days with growth medium containing 250 ^g/ml geneticin 
(G418; Gibco BRL cat. no. 11811) as the selective marker. Within two weeks, 
colonies were picked and expanded into permanent cell lines. The cells were 
then washed and collected by centrifugation. 

It should be noted that the procedure for transient transfection is 
the same, through the point of incubation with the Lipofectamine™-DNA 
solution. Thereafter, the cells are washed and incubated for 72 hours in growth 
medium. 

All references cited with respect to synthetic, preparative and analytical 
procedures are incorporated herein by reference. 

The present invention may be embodied in other specific forms without 
departing from the spirit or essential attributes thereof and, accordingly, 
reference should be made to the appended claims, rather than to the foregoing 
specification, as indication the scope of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Allegheny University of the Health Sciences 

Halpern, Michael S. 
England, James M. 

(ii) TITLE OF INVENTION: CANCER VACCINE 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Seidel , Gonda, Lavorgna & Monaco, P.C. 

(B) STREET: Suite 1800, Two Penn Center Plaza 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 
<F) ZIP: 19102 

{v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/010,262 

(B) FILING DATE: 19-JAN-1996 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Monaco, Daniel A. 

(B) REGISTRATION NUMBER: 30,480 

(C) REFERENCE /DOCKET NUMBER : 7933-33 PC 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (215) 568-8383 

(B) TELEFAX: (215) 568-5549 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCAGTTCCAG CCTGGAGAGA ACCTATA 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GATCTATAGG TTCTCTCCAG GCTGGAACTG GGGCT 35 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAGACTGTGC CCTGTCCACG GTGCCTCCTG CATGTCCTGC TGCCCTGAGC TGTCCCGAGC 6 0 

TAGGTGACAG CGTACCACGC TGCCACCATG AATGAGGTGT CTGTCATCAA AGAAGGCTGG 12 0 

CTCCACAAGC GTGGTGAATA CATCAAGACC TGGAGGCCAC GGTACTTCCT GCTGAAGAGC 180 

GACGGCTCCT TCATTGGGTA CAAGGAGAGG CCCGAGGCCC CTGATCAGAC TCTACCCCCC 24 0 

TTAAACAACT TCTCCGTAGC AGAATGCCAG CTGATGAAGA CCGAGAGGCC GCGACCCAAC 300 

ACCTTTGTCA TACGCTGCCT GCAGTGGACC ACAGTCATCG AGAGGACCTT CCACGTGGAT 360 

TCTCCAGACG AGAGGGAGGA GTGGATGCGG GCCATCCAGA TGGTCGCCAA CAGCCTCAAG 42 0 

CAGCGGGCCC CAGGCGAGGA CCCCATGGAC TACAAGTGTG GCTCCCCCAG TGACTCCTCC 480 

ACGACTGAGG AGATGGAAGT GGCGGTCAGC AAGGCACGGG CTAAAGTGAC CATGAATGAC 54 0 

TTCGACTATC TCAAACTCCT TGGCAAGGGA ACCTTTGGCA AAGTCATCCT GGTGCGGGAG 600 

AAGGCCACTG GCCGCTACTA CGCCATGAAG ATCCTGCGAA AGGAAGTCAT CATTGCCAAG 66 0 

GATGAAGTCG CTCACACAGT CACCGAGAGC CGGGTCCTCC AGAACACCAG GCACCCGTTC 72 0 

CTCACTGCGC TGAAGTATGC CTTCCAGACC CACGACCGCC TGTGCTTTGT GATGGAGTAT 78 0 

GCCAACGGGG GTGAGCTGTT CTTCCACCTG TCCCGGGAGC GTGTCTTCAC AGAGGAGCGG 840 

GCCCGGTTTT ATGGTGCAGA GATTGTCTCG GCTCTTGAGT ACTTGCACTC GCGGGACGTG 900 

GTATACCGCG ACATCAAGCT GGAAAACCTC ATGC TGGACA AAGATGGCCA CATCAAGATC 96 0 
ACTGACTTTG GCCTCTGCAA AGAGGGCATC AGTGACGGGG CCACCATGAA AACCTTCTGT 
GGGACCCCGG AGTACCTGGC GCCTGAGGTG CTGGAGGACA ATGACTATGG CCGGGCCGTG 
GACTGGTGGG GGCTGGGTGT GGTCATGTAC GAGATGATGT GCGGCCGCCT GCCCTTCTAC 
AACCAGGACC ACGAGCGCCT CTTCGAGCTC ATCCTCATGG AAGAGATCCG CTTCCCGCGC 
ACGCTCAGCC CCGAGGCCAA GTCCCTGCTT GCTGGGCTGC TTAAGAAGGA CCCCAAGCAG 
AGGCTTGGTG GGGGGCCCAG CGATGCCAAG GAGGTCATGG AGCACAGGTT CTTCCTCAGC 
ATCAACTGGC AGGACGTGGT CCAGAAGAAG CTCCTGCCAC CCTTCAAACC TCAGGTCACG 
TCCGAGGTCG ACACAAGGTA CTTCGATGAT GAATTTACCG CCCAGTCCAT CACAATCACA 
CCCCCTGACC GCTATGACAG CCTGGGCTTA CTGGAGCTGG ACCAGCGGAC CCACTTCCCC 
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CAGTTCTCCT ACTCGGCCAG CATCCGCGAG TGAGCAGTCT GCCCACGCAG AGGACGCACG 
CTCGCTGCCA TCACCGCTGG GTGGTTTTTT ACCCCTGCC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 530 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



AATTCTCGAG 


CTCGTCGACC 


GGTCGACGAG 


CTCGAGGGTC 


GACGAGCTCG 


AGGGCGCGCG 


60 


CCCGGCCCCC 


ACCCCTCGCA 


GCACCCCGCG 


CCCCGCGCCC 


TCCCAGCCGG 


GTCCAGCCGG 


120 


AGCCATGGGG 


CCGGAGCCGC 


AGTGAGCACC 


ATGGAGCTGG 


CGGCCTTGTG 


CCGCTGGGGG 


180 


CTCCTCCTCG 


CCCTCTTGCC 


CCCCGGAGCC 


GCGAGCACCC 


AAGTGTGCAC 


CGGCACAGAC 


240 


ATGAAGCTGC 


GGCTCCCTGC 


CAGTCCCGAG 


ACCCACCTGG 


ACATGCTCCG 


CCACCTCTAC 


300 


CAGGGCTGCC 


AGGTGGTGCA 


GGGAAACCTG 


GAACTCACCT 


ACCTGCCCAC 


CAATGCCAGC 


360 


CTGTCCTTCC 


TGCAGGATAT 


CCAGGAGGTG 


CAGGGCTACG 


TGCTCATCGC 


TCACAACCAA 


420 


GTGAGGCAGG 


TCCCACTGCA 


GAGGCTGCGG 


ATTGTGCGAG 


GCACCCAGCT 


CTTTGAGGAC 


480 


AACTATGCCC 


TGGCCGTGCT 


AGACAATGGA 


GACCCGCTGA 


ACAATACCAC 


CCCTGTCACA 


540 


GGGGCCTCCC 


CAGGAGGCCT 


GCGGGAGCTG 


CAGCTTCGAA 


GCCTCACAGA 


GATCTTGAAA 


600 


GGAGGGGTCT 


TGATCCAGCG 


GAACCCCCAG 


CTCTGCTACC 


AGGACACGAT 


TTTGTGGAAG 


660 


GACATCTTCC 


ACAAGAACAA 


CCAGCTGGCT 


CTCACACTGA 


TAGACACCAA 


CCGCTCTCGG 


720 


GCCTGCCACC 


CCTGTTCTCC 


GATGTGTAAG 


GGCTCCCGCT 


GCTGGGGAGA 


GAGTTCTGAG 


780 


GATTGTCAGA 


GCCTGACGCG 


CACTGTCTGT 


GCCGGTGGCT 


GTGCCCGCTG 


CAAGGGGCCA 


840 


CTGCCCACTG 


ACTGCTGCCA 


TGAGCAGTGT 


GCTGCCGGCT 


GCACGGGCCC 


CAAGCACTCT 


900 


GACTGCCTGG 


CCTGCCTCCA 


CTTCAACCAC 


AGTGGCATCT 


GTGAGCTGCA 


CTGCCCAGCC 


960 


CTGGTCACCT 


ACAACACAGA 


CACGTTTGAG 


TCCATGCCCA 


ATCCCGAGGG 


CCGGTATACA 


1020 


TTCGGCGCCA 


GCTGTGTGAC 


TGCCTGTCCC 


TACAACTACC 


TTTCTACGGA 


CGTGGGATCC 


1080 


TGCACCCTCG 


TCTGCCCCCT 


GCACAACCAA 


GAGGTGACAG 


CAGAGGATGG 


AACACAGCGG 


1140 


TGTGAGAAGT 


GCAGCAAGCC 


CTGTGCCCGA 


GTGTGCTATG 


GTCTGGGCAT 


GGAGCACTTG 


1200 


CGAGAGGTGA 


GGGCAGTTAC 


CAGTGCCAAT 


ATCCAGGAGT 


TTGCTGGCTG 


CAAGAAGATC 


1260 


TTTGGGAGCC 


TGGCATTTCT 


GCCGGAGAGC 


TTTGATGGGG 


ACCCAGCCTC 


CAACACTGCC 


1320 


CCGCTCCAGC 


CAGAGCAGCT 


CCAAGTGTTT 


GAGACTCTGG 


AAGAGATCAC 


AGGTTACCTA 


1380 


TACATCTCAG 


CATGGCCGGA 


CAGCCTGCCT 


GACCTCAGCG 


TCTTCCAGAA 


CCTGCAAGTA 


1440 


ATCCGGGGAC 


GAATTCTGCA 


CAATGGCGCC 


TACTCGCTGA 


CCCTGCAAGG 


GCTGGGCATC 


1500 
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AGCTGGCTGG GGCTGCGCTC ACTGAGGGAA CTGGGCAGTG GACTGGCCCT CATCCACCAT 1560 
AACACCCACC TCTGCTTCGT GCACACGGTG CCCTGGGACC AGCTCTTTCG GAACCCGCAC 1620 
CAAGCTCTGC TCCACACTGC CAACCGGCCA GAGGACGAGT GTGTGGGCGA GGGCCTGGCC 1680 
TGCCACCAGC TGTGCGCCCG AGGGCACTGC TGGGGTCCAG GGCCCACCCA GTGTGTCAAC 1740 
TGCAGCCAGT TCCTTCGGGG CCAGGAGTGC GTGGAGGAAT GCCGAGTACT GCAGGGGCTC 1800 
CCCAGGGAGT ATGTGAATGC CAGGCACTGT TTGCCGTGCC ACCCTGAGTG TCAGCCCCAG 186 0 
AATGGCTCAG TGACCTGTTT TGGACCGGAG GCTGACCAGT GTGTGGCCTG TGCCCACTAT 192 0 
AAGGACCCTC CCTTCTGCGT GGCCCGCTGC CCCAGCGGTG TGAAACCTGA CCTCTCCTAC 1980 
ATGCCCATCT GGAAGTTTCC AGATGAGGAG GGCGCATGCC AGCCTTGCCC CATCAACTGC 2 04 0 
ACCCACTCCT GTGTGGACCT GGATGACAAG GGCTGCCCCG CCGAGCAGAG AGCCAGCCCT 2100 
CTGACGTCCA TCGTCTCTGC GGTGGTTGGC ATTCTGCTGG TCGTGGTCTT GGGGGTGGTC 2160 
TTTGGGATCC TCATCAAGCG ACGGCAGCAG AAGATCCGGA AGTACACGAT GCGGAGACTG 222 0 

CTGCAGGAAA CGGAGCTGGT GGAGCCGCTG ACACCTAGCG GAGCGATGCC CAACCAGGCG 2280 

CAGATGCGGA TCCTGAAAGA GACGGAGCTG AGGAAGGTGA AGGTGCTTGG ATCTGGCGCT 234 0 

TTTGGCACAG TCTACAAGGG CATCTGGATC CCTGATGGGG AGAATGTGAA AATTCCAGTG 2400 

GCCATCAAAG TGTTGAGGGA AAACACATCC CCCAAAGCCA ACAAAGAAAT CTTAGACGAA 2460 

GCATACGTGA TGGCTGGTGT GGGCTCCCCA TATGTCTCCC GCCTTCTGGG CATCTGCCTG 2 520 

ACATCCACGG TGCAGCTGGT GACACAGCTT ATGCCCTATG GCTGCCTCTT AGACCATGTC 2 580 

CGGGAAAACC GCGGACGCCT GGGCTCCCAG GACCTGCTGA ACTGGTGTAT GCAGATTGCC 2640 

AAGGGGATGA GCTACCTGGA GGATGTGCGG CTCGTACACA GGGACTTGGC CGCTCGGAAC 2700 

GTGCTGGTCA AGAGTCCCAA CCATGTCAAA ATTACAGACT TCGGGCTGGC TCGGCTGCTG 2 760 

GACATTGACG AG AC AGAG TA CCATGCAGAT GGGGGCAAGG TGCCCATCAA GTGGATGGCG 2 82 0 

CTGGAGTCCA TTCTCCGCCG GCGGTTCACC CACCAGAGTG ATGTGTGGAG TTATGGTGTG 28 80 

ACTGTGTGGG AGCTGATGAC TTTTGGGGCC AAACCTTACG ATGGGATCCC AGCCCGGGAG 2 940 

ATCCCTGACC TGCTGGAAAA GGGGGAGCGG CTGCCCCAGC CCCCCATCTG CACCATTGAT 3 0 00 

GTCTACATGA TCATGGTCAA ATGTTGGATG ATTGACTCTG AATGTCGGCC AAG AT TCCGG 3 06 0 

GAGTTGGTGT CTGAATTCTC CCGCATGGCC AGGGACCCCC AGCGCTTTGT GGTCATCCAG 3120 

AATGAGGACT TGGGCCCAGC CAGTCCCTTG GACAGCACCT TCTACCGCTC ACTGCTGGAG 3180 

GACGATGACA TGGGGGACCT GGTGGATGCT GAGGAGTATC TGGTACCCCA GCAGGGCTTC 324 0 

TTCTGTCCAG ACCCTGCCCC GGGCG CTGGG GGCATGGTCC ACCACAGGCA CCGCAGCTCA 33 00 

TCTACCAGGA GTGGCGGTGG GGACCTGACA CTAGGGCTGG AGCCCTCTGA AGAGGAGGCC 3 360 

CCCAGGTCTC CACTGGCACC CTCCGAAGGG GCTGGCTCCG ATGTATTTGA TGGTGACCTG 34 20 
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GGAATGGGGG CAGCCAAGGG GCTGCAAAGC CTCCCCACAC ATGACCCCAG CCCTCTACAG 348 0 

CGGTACAGTG AGGACCCCAC AGTACCCCTG CCCTCTGAGA CTGATGGCTA CGTTGCCCCC 3 54 0 

CTGACCTGCA GCCCCCAGCC TGAATATGTG AACCAGCCAG ATGTTCGGCC CCAGCCCCCT 3600 

TCGCCCCGAG AGGGCCCTCT GCCTGCTGCC CGACCTGCTG GTGCCACTCT GGAAAGGGCC 3660 

AAGACTCTCT CCCCAGGGAA GAATGGGGTC GTCAAAGACG TTTTTGCCTT TGGGGGTGCC 3 72 0 

GTGGAGAACC CCGAGTACTT GACACCCCAG GGAGGAGCTG CCCCTCAGCC CCACCCTCCT 37 8 0 

CCTGCCTTCA GCCCAGCCTT CGACAACCTC TATTACTGGG ACCAGGACCC ACCAGAGCGG 3840 

GGGGCTCCAC CCAGCACCTT CAAAGGGACA CCTACGGCAG AGAACCCAGA GTACCTGGGT 3 900 

CTGGACGTGC CAGTGTGAAC CAGAAGGCCA AGTCCGCAGA AGCCCTGATG TGTCCTCAGG 3 96 0 

GAGCAGGGAA GGCCTGACTT CTGCTGGCAT CAAGAGGTGG GAGGGCCCTC CGACCACTTC 402 0 

CAGGGGAACC TGCCATGCCA GGAACCTGTC CTAAGGAACC TTCCTTCCTG CTTGAGTTCC 4080 

CAGATGGCTG GAAGGGGTCC AGCCTCGTTG GAAGAGGAAC AGCACTGGGG AGTCTTTGTG 414 0 

GATTCTGAGG CCCTGCCCAA TGAGACTCTA GGGTCCAGTG GATGCCACAG CCCAGCTTGG 4200 

CCCTTTCCTT CCAGATCCTG GGTACTGAAA GCCTTAGGGA AGCTGGCCTG AGAGGGGAAG 4260 

CGGCCCTAAG GGAGTGTCTA AGAACAAAAG CGACCCATTC AGAGACTGTC CCTGAAACCT 4 320 

AGTACTGCCC CCCATGAGGA AGGAACAGCA ATGGTGTCAG TATCCAGGCT TTGTACAGAG 438 0 

TGCTTTTCTG TTTAGTTTTT ACTTTTTTTG TTTTGTTTTT TTAAAGACGA AATAAAG AC C 444 0 

CAGGGGAGAA TGGGTGTTGT ATGGGGAGGC AAGTGTGGGG GGTCCTTCTC CACACCCACT 4 500 

TTGTCCATTT GCAAATATAT TTTGGAAAAC 4 530 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 6 0 

GCTTCGGAAC AAGAGACCCT GGATCTTGAT GCTGGTGTAA GTGAACATTC AGGTGATTGG 12 0 

TTGGATCAGG ATTCAGTTTC AGATCAGTTT AGTGTAGAAT TTGAAGTTGA ATCTCTCGAC 18 0 

TCAGAAGATT ATAGCCTTAG TGAAGAAGGA CAAGAACTCT CAGATGAAGA TGATGAGGTA 24 0 

TATCAAGTTA CTGTGTATCA GGCAGGGGAG AGTGATACAG ATTCATTTGA AGAAGATCCT 300 

GAAATTTCCT TAGCTGACTA TTGGAAATGC ACTTCATGCA ATGAAATGAA TCCCCCCCTT 360 

CCATCACATT GCAACAGATG TTGGGCCCTT CGTGAGAATT GGCTTCCTGA AGATAAAGGG 420 

AAAGATAAAG GGGAAATCTC TGAGAAAGCC AAACTGGAAA ACTCAACACA AGCTGAAGAG 480 
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GGCTTTGATG TTCCTGATTG TAAAAAAACT ATAGTGAATG ATTCCAGAGA GTCATGTGTT 
GAGGAAAATG ATGATAAAAT TACACAAGCT TCACAATCAC AAGAAAGTGA AGACTATTCT 
CAGCCATCAA CTTCTAGTAG CATTATTTAT AGCAGCCAAG AAGATGTGAA AGAGTTTGAA 
AGGGAAGAAA CCCAAGACAA AGAAGAGAGT GTGGAATCTA GTTTGCCCCT TAATGCCATT 
GAACCTTGTG TGATTTGTCA AGGTCGACCT AAAAATGGTT GCATTGTCCA TGGCAAAACA 
GGACATCTTA TGGCCTGCTT TACATGTGCA AAGAAGCTAA AGAAAAGGAA TAAGCCCTGC 
CCAGTATGTA GACAACCAAT TCAAATGATT GTGCTAACTT ATTTCCCCTA G 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 
ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 
GCTTCGGAAC AAGAGACCCT GGACTATTGG AAATGCACTT CATGCAATGA AATGAATCCC 
CCCCTTCCAT CACATTGCAA CAGATGTTGG GCCCTTCGTG AGAATTGGCT TCCTGAAGAT 
AAAGGGAAAG ATAAAGGGGA AATCTCTGAG AAAGCCAAAC TGGAAAACTC AACACAAGCT 
GAAGAGGGCT TTGATGTTCC TGATTGTAAA AAAACTATAG TGAATGATTC CAGAGAGTCA 
TGTGTTGAGG AAAATGATGA TAAAATTACA CAAGCTTCAC AATCACAAGA AAGTGAAGAC 
TATTCTCAGC CATCAACTTC TAGTAGCATT ATTTATAGCA GCCAAGAAGA TGTGAAAGAG 
TTTGAAAGGG AAGAAACCCA AGACAAAGAA GAGAG TGTGG AATCTAGTTT GCCCCTTAAT 
GCCATTGAAC CTTGTGTGAT TTGTCAAGGT CGACCTAAAA ATGGTTGCAT TGTCCATGGC 
AAAACAGGAC ATCTTATGGC CTGCTTTACA TGTGCAAAGA AGCTAAAGAA AAGGAATAAG 
CCCTGCCCAG TATGTAGACA ACCAATTCAA ATGATTGTGC TAACTTATTT CCCCTAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 
GCTTCGGAAC AAGAGACCCT GGTTAGACCA AAGCCATTGC TTTTGAAGTT ATTAAAGTCT 
GTTGGTGCAC AAAAAGACAC TTATACTATG AAAGAGGATC TTGATGCTGG TGTAAGTGAA 
CATTCAGGTG ATTGGTTGGA TCAGGATTCA GTTTCAGATC AGTTTAGTGT AGAATTTGAA 
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840 

891 
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GTTGAATCTC 


TCGACTCAGA 


AGATTATAGC 


CTTAGTGAAG 


AAGGACAAGA 


ACTCTCAGAT 


300 


GAAGATGATG 


AGGTATATCA 


AGTTACTGTG 


TATCAGGCAG 


GGGAGAGTGA 


TACAGATTCA 


360 


TTTGAAGAAG 


ATCCTGAAAT 


TTCCTTAGCT 


GACTATTGGA 


AATGCACTTC 


ATGCAATGAA 


420 


ATGAATCCCC 


CCCTTCCATC 


ACATTGCAAC 


AGATGTTGGG 


CCCTTCGTGA 


GAATTGGCTT 


480 


CCTGAAGATA 


AAGGGAAAGA 


TAAAGGGGAA 


ATCTCTGAGA 


AAGCCAAACT 


GGAAAACTCA 


540 


ACACAAGCTG 


AAGAGGGCTT 


TGATGTTCCT 


GATTGTAAAA 


AAACTATAGT 


GAATGATTCC 


600 


AGAGAGTCAT 


GTGTTGAGGA 


AAATGATGAT 


AAAATTACAC 


AAGCTTCACA 


ATCACAAGAA 


660 


AGTGAAGACT 


ATTCTCAGCC 


ATCAACTTCT 


AGTAGCATTA 


TTTATAGCAG 


CCAAGAAGAT 


720 


GTGAAAGAGT 


TTGAAAGGGA 


AGAAACCCAA 


GACAAAGAAG 


AGAGTGTGGA 


ATCTAGTTTG 


780 


CCCCTTAATG 


CCATTGAACC 


TTGTGTGATT 


TGTCAAGGTC 


GACCTAAAAA 


TGGTTGCATT 


840 


GTCCATGGCA 


AAACAGGACA 


TCTTATGGCC 


TGCTTTACAT 


GTGCAAAGAA 


GCTAAAGAAA 


900 


AGGAATAAGC 


CCTGCCCAGT 


ATGTAGACAA 


CCAATTCAAA 


TGATTGTGCT 


AACTTATTTC 


960 


CCCTAG 












966 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 60 

GCTTCGGAAC AAGAGACCCT GGTTAGACAA GAAAGTGAAG ACTATTCTCA GCCATCAACT 12 0 

TCTAGTAGCA TTATTTATAG CAG CCAAGAA GATGTGAAAG AGTTTGAAAG GGAAGAAACC 18 0 

CAAGACAAAG AAGAGAGTGT GGAATCTAGT TTGC CCCTTA ATGCCATTGA ACCTTGTGTG 24 0 

ATTTGTCAAG GTCGACCTAA AAATGGTTGC ATTGTCCATG GCAAAACAGG ACATCTTATG 300 

GCCTGCTTTA CATGTGCAAA GAAGCTAAAG AAAAGGAATA AGCCCTGCCC AGTATGTAGA 360 

CAACCAATTC AAATGATTGT GCTAACTTAT TTCCCCTAG 3 99 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ATGTGCAATA CCAACATGTC TGTACCTACT GATGGTGCTG TAACCACCTC ACAGATTCCA 60 
GCTTCGGAAC AAGAGACCCT GGTTAGACCA AAGCCATTGC TTTTGAAGTT ATTAAAGTCT 12 0 
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TATCAATGTT CCTCAGCCAG CTGCTGCAGC TATTCAGAGA CACTATACTG ATGAAGACCC 
TGAGAAAGAA AAACGAATAA AGGAATTAGA GTTGCTACTT ATGTCGACTG AGAATGAACT 
GAAAGGGCAG CAGGCATTAC CAACACAGAA CCACACAGCA AACTACCCCG GCTGGCACAG 
CACCACGGTT GCTGACAATA CCAGGACCAG TGGTGACAAT GCGCCTGTTT CCTGTTTGGG 
GGAACATCAC CACTGTACTC CATCTCCACC AGTGGATCAT GGTTGCTTAC CTGAGGAAAG 
TGCGTCCCCC GCACGGTGCA TGATTGTTCA CCAGAGCAAC ATCCTGGATA ATGTTAAGAA 
TCTCTTAGAA TTTGCAGAAA CACTCCAGTT AATAGACTCC TTCTTAAACA CATCGTCCAA 
TCACGAGAAT CTGAACCTGG ACAACCCTGC ACTAACCTCC ACGCCAGTGT GTGGCCACAA 
GATGTCTGTT ACCACCCCAT TCCACAAGGA CCAGACTTTC ACTGAATACA GGAAGATGCA 
CGGCGGAGCA GTCTAGAGCT CAATTATAAT AATCTTGCGA ATCGGGCTGT AACGGGGCAA 



309 



GTTGGTGCAC AAAAAGACAC TTATACTATG AAAGAGGTTC TTTTTTATCT TGGCCAGTAT 180 
ATTATGACTA AACGATTATA TGATGAGAAG CAACAACATA TTGTAAATGA TTGTGCTAAC 240 
TTATTTCCCC TAGTTGACCT GTCTATAAGA GAATTATATA TTTCTAACTA TATAACCCTA 300 
GGAATTTAG 

(2} INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CACAGATAAG 


GTTATTTGGG 


TACCCTCTCG 


AAAAGTTAAA 


CCGGACATCG 


CCCAAAAGGA 


60 


TGAGGTGACT 


AAGAAAGATG 


AGGCGAGCCC 


TCTTTTTGCA 


GGCTGGAGGC 


ACATAGATAA 


120 


GAGAATTATC 


ACTCTACATT 


CATCTTTCTC 


AAAGATTAAT 


CTACTTGTGT 


GTTTTATATT 


180 


TCATTAGAAT 


CGGACAGATG 


TTCAGTGCCA 


GCACCGGTGG 


CAGAAAGTAT 


TAAACCCAGA 


240 


ACTTAACAAA 


GGTCCATGGA 


CTAAAGAGGA 


GGATCAAAGG 


GTAATAGAAC 


ACGTG C AG AA 


300 


ATACGGTCCA 


AAGCGCTGGT 


CGGACATTGC 


TAAGCATTTG 


AAGGGAAGGA 


TTGGAAAACA 


360 


GTGCAGGGAG 


AGGTGGCACA ACCATCTGAA 


TCCAGAAGTG 


AAGAAAACCT 


CCTGGACAGA 


420 


AGAGGAAGAT 


AGAATTATTT 


ACCAGGCACA 


CAAGAGACTG 


GGAAACAGAT 


GGGCAGAAAT 


480 


TGCAAAGTTG 


CTGCCTGGAC 


GGACTGATAA 


CGCTGTCAAG 


AACCACTGGA 


ATTCCACCAT 


540 


GCGCCGGAAG 


GTCGAGCAGG 


AGGGTTACCC 


GCAGGAGTCC 


TCCAAAGCCG 


GCCCGCCCTC 


600 


GGCAACCACC 


GGCTTCCAGA 


AGAGCAGCCA 


TCTGATGGCC 


TTTGCCCACA 


ACCCACCTGC 


660 


AGGCCCGCTC 


CCGGGGGCCG 


GCCAGGCCCC 


TCTGGGCAGT 


GACTACCCCT 


ACTACCACAT 


720 


TGCTGAGCCA 


CAAAATGTCC 


CTGGTCAGAT 


CCCATATCCA 


GTAGCACTGC 


ATATAAATAT 


780 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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GGCTTGACCG AGGGGACTAT AACATGTATA GGCGAAAAGC GGGGTCTCGG TTGTAACGCG 144 0 

CTTAGGAAGT CCCCTCGAGG TATGGCAGAT ATGCTTTTGC ATAGGGAGGG GGAAATGTAG 1500 

TCTTAATCGT AGGTTAACAT GTATATTACC AAATAAGGGA ATCGCCTGAT GCACCAAATA 156 0 

AGGTATTATA TGATCCCATT GGTGGTGAAG GAGCGACCTG AGGGCATATG GGCGTTAACA 162 0 

GAACTGTCTG TCCTTGCGTC ATTCCTCATC GGATCATGTA CGCGGCAGAG TATGATTGGA 1680 

TAACAGGATG GCACCATTCA TCGTGGCGCA TGCTGATTGG TGCGACTAAG GAGTTGTGTA 174 0 

ACCCACGAAT GTACTTAAGC TTGTAGTTGC TAACAATAAA GTGCCATTCT ACCTCTCACC 1800 

ACATTGGTGT GCACCTGGGT TGATGGCCGG ACCGTCGATT CCCTGACGAC TGCGAACACC I860 

TGAATGAAGC TGAAGGCTTC AGGTACCCTT ACTTGAT 1897 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGCTTGTTTG GCCGTTTTAG GGTTTGTTGG AATTTTTTTT TCGTCTATGT ACTTGTGAAT 6 0 

TATTTCACGT TTGCCATTAC CGGTTCTCCA TAGGGTGATG TTCATTAGCA GTGGTGATAG 12 0 

GTTAATTTTC ACCATCTCTT ATGCGGTTGA ATAGTCACCT CTGAACCACT TTTTCCTCCA 180 

GTAACTCCTC TTTCTTCGGA CCTTCTGCAG CCAACCTGAA AGAATAACAA GGAGGTGGCT 24 0 

GGAAACTTGT TTTAAGGAAC CGCCTGTCCT TCCCCCGCTG GAAACCTTGC ACCTCGGACG 300 

CTCCTGCTCC TGCCCCCACC TGACCCCCGC CCTCGTTGAC ATCCAGGCGC GATGATCTCT 360 

GCTGCCAGTA GAGGGCACAC TTACTTTACT TTCGCAAACC TGAACGCGGG TGCTGCCCAG 420 

AGAGGGGGCG GAGGGAAAGA CGCTTTGCAG CAAAATCCAG CATAGCGATT GGTTGCTCCC 480 

CGCGTTTGCG GCAAAGGCCT GGAGGCAGGA GTAATTTGCA ATCCTTAAAG CTGAATTGTG 54 0 

CAGTGCATCG GATTTGGAAG CTACTATATT CACTTAACAC TTGAACGCTG AGCTGCAAAC 600 

TCAACGGGTA AT AACC CAT C TTGAACAGCG TACATGCTAT ACACACACCC CTTTCCCCCG 660 

AATTGTTTTC TCTTTTGGAG GTGGTGGAGG GAGAGAAAAG TTTACTTAAA ATGCCTTTGG 720 

GTGAGGGACC AAGGATGAGA AGAATGTTTT TTGTTTTTCA TGCCGTGGAA TAACACAAAA 780 

TAAAAAATCC CGAGGGAATA TACATTATAT ATTAAATATA GATCATTTCA GGGAGCAAAC 84 0 

AAATCATGTG TGGGGCTGGG CAACTAGCTG AGTCGAAGCG TAAATAAAAT GTGAATACAC 900 

GTTTGCGGGT TACATACAGT GCACTTTCAC TAGTATTCAG AAAAAATTGT GAGTCAGTGA 960 

ACTAGGAAAT TAATGCCTGG AAGGCAGCCA AATTTTAATT AGCTCAAGAC TCCCCCCCCC 1020 

CCCCAAAAAA AGGCACGGAA GTAATACTCC TCTCCTCTTC TTTGATCAGA ATCGATGCAT 10 80 
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TTTTTGTGCA TGACCGCATT TCCAATAATA AAAGGGGAAA GAGGACCTGG AAAGGAATTA 114 0 

AACGTCCGGT TTGTCCGGGG AGGAAAGAGT TAACGGTTTT TTTCACAAGG GTCTCTGCTG 1200 

ACTCCCCCGG CTCGGTCCAC AAGCTCTCCA CTTGCCCCTT TTAGGAAGTC CGGTCCCGCG 1260 

GTTCGGGTAC CCCCTGCCCC TCCCATATTC TCCCGTCTAG CACCTTTGAT TTCTCCCAAA 1320 

CCCGGCAGCC CGAGACTGTT GCAAACCGGC GCCACAGGGC GCAAAGGGGA TTTGTCTCTT 138 0 

CTGAAACCTG GCTGAGAAAT TGGGAACTCC GTGTGGGAGG CGTGGGGGTG GGACGGTGGG 144 0 

GTACAGACTG GCAGAGAGCA GGCAACCTCC CTCTCGCCCT AGCCCAGCTC TGGAACAGGC 1500 

AGACACATCT CAGGGCTAAA CAGACGCCTC CCGCACGGGG CCCCACGGAA GCCTGAGCAG 1560 

GCGGGGCAGG AGGGGCGGTA TCTGCTGCTT TGGCAGCAAA TTGGGGGACT CAGTCTGGGT 1620 

GGAAGGTATC CAATCCAGAT AGCTGTGCAT ACATAATGCA TAATACATGA CTCCCCCCAA 1680 

CAAATGCAAT GGGAGTTTAT TCATAACGCG CTCTCCAAGT ATACGTGGCA ATGCGTTGCT 174 0 

GGGTTATTTT AATCATTCTA GGCATCGTTT TCCTCCTTAT GCCTCTATCA TTCCTCCCTA 1800 

TCTACACTAA CATCCCACGC TCTGAACGCG CGCCCATTAA TACCCTTCTT TCCTCCACTC i860 

TCCCTGGGAC TCTTGATCAA AGCGCGGCCC TTTCCCCAGC CTTAGCGAGG CGCCCTGCAG 1920 

CCTGGTACGC GCGTGGCGTG GCGGTGGGCG CGCAGTGCGT TCTCTGTGTG GAGGGCAGCT 1980 

GTTCCGCCTG CGATGATTTA TACTCACAGG ACAAGGATGC GGTTTGTCAA ACAGTACTGC 204 0 

TACGGAGGAG CAGCAGAGAA AGGGAGAGGG TTTGAGAGGG AGCAAAAGAA AATGGTAGGC 2100 

GCGCGTAGTT AATTCATGCG GCTCTCTTAC TCTGTTTACA TCCTAGAGCT AGAGTGCTCG 2160 

GCTGCCCGGC TGAGTCTCCT CCCCACCTTC CCCACCCTCC CCACCCTCCC CATAAGCGCC 2220 

CCTCCCGGGT TCCCAAAGCA GAGGGCGTGG GGGAAAAGAA AAAAGATCCT CTCTCGCTAA 2280 

TCTCCGCCCA CCGGCCCTTT ATAATGCGAG GGTCTGGACG GCTGAGGACC CCCGAGCTGT 234 0 

GCTGCTCGCG GCCGCCACCG CCGGGCCCCG GCCGTCCCTG GCTCCCCTCC TGCCTCGAGA 24 00 

AGGGCAGGGC TTCTCAGAGG CTTGGCGGGA AAAAGAACGG AGGGAGGGAT CGCGCTGAGT 2460 

ATAAAAGCCG GTTTTCGGGG CTTTATCTAA CTCGCTGTAG TAATTCCAGC GAGAGGCAGA 2520 

GGGAGCGAGC GGGCGGCCGG CTAGGGTGGA AGAGCCGGGC GAGCAGAGCT GCGCTGCGGG 2580 

CGTCCTGGGA AGGGAGATCC GGAGCGAATA GGGGGCTTCG CCTCTGGCCC AGCCCTCCCG 264 0 

CTGATCCCCC AGCCAGCGGT CCGCAACCCT TGCCGCATCC ACGAAACTTT GCCCATAGCA 2700 

GCGGGCGGGC ACTTTGCACT GGAACTTACA ACACCCGAGC AAGGACGCGA CTCTCCCGAC 2 760 

GCGGGGAGGC TATTCTGCCC ATTTGGGGAC ACTTCCCCGC CGCTGCCAGG ACCCGCTTCT 2820 

CTGAAAGGCT CTCCTTGCAG CTGCTTAGAC GCTGGATTTT TTTCGGGTAG TGGAAAACCA 2880 

GGTAAGCACC GAAGTCCACT TGCCTTTTAA TTTATTTTTT TATCACTTTA ATGCTGAGAT 294 0 

GAGTCGAATG CCTAAATAGG GTGTCTTTTC TCCCATTCCT GCGCTATTGA CACTTTTCTC 3000 
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AGAGTAGTTA 


TGGTAACTGG 


GGCTGGGGTG 


GGGGGTAATC 


CAGAACTGGA 


TCGGGGTAAA 


3060 


GTGACTTGTC 


AAGATGGGAG 


AGGAGAAGGC 


AGAGGGAAAA 


CGGGAATGGT 


TTTTAAGACT 


3120 


ACCCTTTCGA 


GATTTCTGCC 


TTATGAATAT 


ATTCACGCTG 


ACTCCCGGCC 


GGTCGGACAT 


3180 


TCCTGCTTTA 


TTGTGTTAAT 


TGCTCTCTGG 


GTTTTGGGGG 


GCTGGGGGTT 


GCTTTGCGGT 


3240 


GGGCAGAAAG 


CCCCTTGCAT 


CCTGAGCTCC 


TTGGAGTAGG 


GACCGCATAT 


CGCCTGTGTG 


3300 


AGCCAGATCG 


CTCCGCAGCC 


GCTGACTTGT 


CCCCGTCTCC 


GGGAGGGCAT 


TTAAATTTCG 


3360 


GCTCACCGCA 


TTTCTGACAG 


CCGGAGACGG 


ACACTGCGGC 


GCGTCCCGCC 


CGCCTGTCCC 


3420 


CGCGGCGATT 


CCAACCCGCC 


CTGATCCTTT 


TAAGAAGTTG 


GCATTTGGCT 


TTTTAAAAAG 


3480 


CAATAATACA 


ATTTAAAACC 


TGGGTCTCTA 


GAGGTGTTAG 


GACGTGGTGT 


TGGGTAGGCG 


3540 


CAGGCAGGGG 


AAAAGGGAGG 


CGAGGATGTG 


TCCGATTCTC 


CTGGAATCGT 


TGACTTGGAA 


3600 


AAACCAGGGC 


GAATCTCCGC 


ACCCAGCCCT 


GACTCCCCTG 


CCGCGGCCGC 


CCTCGGGTGT 


3660 


CCTCGCGCCC 


GAGATGCGGA 


GGAACTGCGA 


GGAGCGGGGC 


TCTGGGCGGT 


TC C AG AAC AG 


3720 


CTGCTACCCT 


TGGTGGGGTG 


GCTCCGGGGG 


AGGTATCGCA 


GCGGGGTCTC 


TGGCGCAGTT 


3780 


GCATCTCCGT 


ATTGAGTGCG 


AAGGGAGGTG 


CCCCTATTAT 


TATTTGACAC 


CCCCCTTGTA 


3840 


TTTATGGAGG 


GGTGTTAAAG 


CCCGCGGCTG 


AGCTCGCCAC 


TCCAGCCGGC 


GAGAGAAA*- i 


3900 


AGAAAAGCTG 


GCAAAAGGAG 


TGTTGGACGG 


GGGCGGTACT 


GGGGGTGGGG 


ACGGGGGCGG 


3960 


TGGAGAGGGA 


AGGTTGGGAG 


GGGCTGCGGT 


GCCGGCGGGG 


GTAGGAGAGC 


GGCTAGGGCG 


4020 


CGAGTGGGAA 


CAGCCGCAGC 


GGAGGGGCCC 


CGGCGCGGAG 


CGGGGTTCAC 


GCAGCCGCTA 


4080 


GCGCCCAGGC 


GCCTCTCGCC 


TTCTCCTTCA GGTGGCGCAA AACTTtGTGC 


CTTGGATTTT 


4140 


GGCAAATTGT 


TTTCCTCACC 


GCCACCTCCC 


GCGGCTTCTT 


AAGGGCGCCA 


GGGCCGATTT 


4200 


CGATTCCTCT 


GCCGCTGCGG 


GGCCGACTCC 


CGGGCTTTGC 


GCTCCGGGCT 


CCCGGGGGAG 


4260 


CGGGGGCTCG 


GCGGGCACCA AGCCGCTGGT 


TCACTAAGTG 


CGTCTCCGAG 


ATAGCAGGGG 


4320 


ACTGTCCAAA 


GGGGGTGAAA 


GGGTGCTCCC 


TTTATTCCCC 


CACCAAGACC 


ACCCAGCCGC 


4380 


TTTAGGGGAT 


AGCTCTGCAA 


GGGGAGAGGT 


TCGGGACTGT 


GGCGCGCACT 


GCGCGCTGCG 


4440 


CCAGGTTTCC 


GCACCAAGAC 


CCCTTTAACT 


CAAGACTGCC 


TCCCGCTTTG 


TGTGCCCCGC 


4500 


TCCAGCAGCC 


TCCCGCGACG 


ATGCCCCTCA 


ACGTTAGCTT 


CACCAACAGG 


AACTATGACC 


4560 


TCGACTACGA 


CTCGGTGCAG 


CCGTATTTCT 


ACTGCGACGA 


GGAGGAGAAC 


TTCTACCAGC 


4620 


AGCAGCAGCA 


GAGCGAGCTG 


CAGCCCCCGG 


CGCCCAGCGA 


GGATATCTGG 


AAGAAATTCG 


4680 


AGCTGCTGCC 


CACCCCGCCC 


CTGTCCCCTA 


GCCGCCGCTC 


CGGGCTCTGC 


TCGCCCTCCT 


4740 


ACGTTGCGGT 


CACACCCTTC 


TCCCTTCGGG 


GAGACAACGA 


CGGCGGTGGC 


GGGAGCTTCT 


4800 


CCACGGCCGA 


CCAGCTGGAG 


ATGGTGACCG 


AGCTGCTGGG 


AGGAGACATG 


GTGAACCAGA 


4860 


GTTTCATCTG 


CGACCCGGAC 


GACGAGACCT 


TCATCAAAAA 


CATCATCATC 


CAGGACTGTA 


4920 
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TGTGGAGCGG CTTCTCGGCC GCCGCCAAGC TCGTCTCAGA GAAGCTGGCC TCCTACCAGG 4980 

CTGCGCGCAA AGACAGCGGC AGCCCGAACC CCGCCCGCGG CCACAGCGTC TGCTCCACCT 504 0 

CCAGCTTGTA CCTGCAGGAT CTGAGCGCCG CCGCCTCAGA GTGCATCGAC CCCTCGGTGG 5100 

TCTTCCCCTA CCCTCTCAAC GACAGCAGCT CGCCCAAGTC CTGCGCCTCG CAAGACTCCA 5160 

GCGCCTTCTC TCCGTCCTCG GATTCTCTGC TCTCCTCGAC GGAGTCCTCC CCGCAGGGCA 5220 

GCCCCGAGCC CCTGGTGCTC CATGAGGAGA CACCGCCCAC CACCAGCAGC GACTCTGGTA 5280 

AGCGAAGCCC GCCCAGGCCT GTCAAAAGTG GGCGGCTGGA TACCTTTCCC ATTTTCATTG 5340 

GCAGCTTATT TAACGGGCCA CTCTTATTAG GAAGGAGAGA TAGCAGATCT GGAGAGATTT 5400 

GGGAGCTCAT CACCTCTGAA ACCTTGGGCT TTAGCGTTTC CTCCCATCCC TTCCCCTTAG 54 60 

ACTGCCCATG TTTGCAGCCC CCCTCCCCGT TTGTCTCCCA CCCCTCAGGA ATTTCATTTA 5520 

GGTTTTTAAA CCTTCTGGCT TATCTTACAA CTCAATCCAC TTCTTCTTAC CTCCCGTTAA 5580 

CATTTTAATT GCCCTGGGGC GGGGTGGCAG GGAGTGTATG AATGAGGATA AGAGAGGATT 564 0 

GATCTCTGAG AGTGAATGAA TTGCTTCCCT CTTAACTTCC GAGAAGTGGT GGGATTTAAT 5700 

GAACTATCTA CAAAAATGAG GGGCTGTGTT TAGAGGCTAG GCAGGGCCTG CCTGAGTGCG 5760 

GGAGCCAGTG AACTGCCTCA AGAGTGGGTG GGCTGAGGAG CTGGGATCTT CTCAGCCTAT 5 82 0 

TTTGAACACT GAAAAGCAAA TCCTTGCCAA AGTTGGACTT TTTTTTTTCT TTTATTCCTT 5880 

CCCCCGCCCT CTTGGACTTT TGGCAAAACT GCAATTTTTT TTTTTTTATT TTTCATTTCC 594 0 

AGTAAAATAG GGAGTTGCTA AAGTCATACC AAGCAATTTG C AG CTATC AT TTGCAACACC 60 00 

TGAAGTGTTC TTGGTAAAGT CCCTCAAAAA TAGGAGGTGC TTGGGAATGT GCTTTGCTTT 606 0 

GGGTGTGTCC AAAGCCTCAT TAAGTCTTAG GTAAGAATTG GCATCAATGT CCTATCCTGG 6120 

GAAGTTGCAC TTTTCTTGTC CATGCCATAA CCCAGCTGTC TTTCCCTTTA TGAGACTCTT 6180 

ACCTTCATGG TGAGAGGAGT AAGGGTGGCT GGCTAGATTG GTTCTTTTTT TTTTTTTTTC 6 24 0 

CTTTTTTAAG ACGGAGTCTC ACTCTGTCAC TAGGCTGGAG TGCAGTGGCG CAATCAACCT 6300 

CCAACCCCCT GGTTCAAGAG ATTCTCCTGC CTCAGCCTCC CAAGTAGCTG GGACTACAGG 6360 

TGCACACCAC CATGCCAGGC TAATTTTTGT AATTTTAGTA GAGATGGGGT TTCATCGTGT 6420 

TGGCCAGGAT GGTCTCTCCT GACCTCACGA TCCGCCCACC TCGGCCTCCC AAAGTGCTGG 6480 

GATTACAGGT GTGAGCCAGG GCACCAGGCT TAGATGTGGC TCTTTGGGGA GATAATTTTG 6 54 0 

TCCAGAGACC TTTCTAACGT ATTCATGCCT TGTATTTGTA CAGCATTAAT CTGGTAATTG 6600 

ATTATTTTAA TGTAACCTTG CTAAAGGAGT GATTTCTATT TCCTTTCTTA AAGAGGAGGA 6660 

ACAAGAAGAT GAGGAAGAAA TCGATGTTGT TTCTGTGGAA AAGAGGCAGG CTCCTGGCAA 672 0 

AAGGTCAGAG TCTGGATCAC CTTCTGCTGG AGGCCACAGC AAACCTCCTC ACAGCCCACT 6780 

GGTCCTCAAG AGGTGCCACG TCTCCACACA TCAGCACAAC TACGCAGCGC CTCCCTCCAC 684 0 
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TCGGAAGGAC TATCCTGCTG CCAAGAGGGT CAAGTTGGAC AGTGTCAGAG TCCTGAGACA 6 900 

GATCAGCAAC AACCGAAAAT GCACCAGCCC CAGGTCCTCG GACACCGAGG AGAATGTCAA 6 960 

GAGGCGAACA CACAACGTCT TGGAGCGCCA GAGGAGGAAC GAGCTAAAAC GGAGCTTTTT 7020 

TGCCCTGCGT GACCAGATCC CGGAGTTGGA AAACAATGAA AAGGCCCCCA AGGTAGTTAT 7080 

CCTTAAAAAA GCCACAGCAT ACATCCTGTC CGTCCAAGCA GAGGAGCAAA AGCTCATTTC 7140 

TGAAGAGGAC TTGTTGCGGA AACGACGAGA ACAGTTGAAA CACAAACTTG AACAGCTACG 72 00 

GAACTCTTGT GCGTAAGGAA AAGTAAGGAA AACGATTCCT TCTAACAGAA ATGTCCTGAG 7260 

CAATCACCTA TGAACTTGTT TCAAATGCAT GATCAAATGC AAC CTCACAA CCTTGGCTGA 73 2 0 

GTC TTG AG AC TGAAAGATTT AG CCATAATG TAAACTGCCT CAAATTGGAC TTTGGGCATA 7380 

AAAGAACTTT TTTATGCTTA CCATCTTTTT TTTTTCTTTA ACAGATTTGT ATTTAAGAAT 744 0 

TGTTTTTAAA AAATTTTAAG ATTTACACAA TGTTTCTCTG TAAATATTGC CATTAAATGT 7500 

AAATAACTTT AATAAAACGT TTATAGCAGT TACACAGAAT TTCAATCCTA GTATATAGTA 7 560 

CCTAGTATTA TAGGTACTAT AAACCCTAAT TTTTTTTATT TAAGTACATT TTGCTTTTTA 762 0 

AAGTTGATTT TTTTCTATTG TTTTTAGAAA AAATAAAATA ACTGGCAAAT ATATCATTGA 768 0 

GCCAAATCTT AAGTTGTGAA TGTTTTGTTT CGTTTCTTCC CCCTCCCAAC CACCACCATC 774 0 

CCTGTTTGTT TTCATCAATT GCCCCTTCAG AGGGCGGTCT TAAGAAAGGC AAGAGTTTTC 7800 

t 

CTCTGTTGAA ATGGGTCTGG GGGCCTTAAG GTCTTTAAGT TCTTGGAGGT TCTAAGATGC 786 0 

TTCCTGGAGA CTATGATAAC AGCCAGAGTT GACAGTTAGA AGGAATGGCA GAAGGCAGGT 792 0 

GAGAAGGTGA GAGGTAGGCA AAGGAGATAC AAGAGGTCAA AGGTAGCAGT TAAGTACACA 7 98 0 

AAGAGGCATA AGGACTGGGG AGTTGGGAGG AAGGTGAGGA AGAAACTCCT GTTACTTTAG 8 04 0 

TTAACCAGTG CCAGTCCCCT GCTCACTCCA AACCCAGGAA TT 8 08 2 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AGGGTTACAC GTCTTAACTC AGAGTTGCAA CAGG CTTGAA CAAGCCCAGG CACGCCCAGA 60 

TACCTAGGGC CGAGTCACCG TTAAAACTAA CAGACCATAA AAGGAAAGGA ATACAGAACA 120 

GACTAGGAGT ACCGGATCTG ACTCACAGGC CACCTGGCAG GAAGAGATAA GCCCCAGCCC 180 

CCGACATTCA GGACGTCCCA GCCCGCACGT ACTCTTACCA TGTTACAACC TCATTCGAAT 24 0 

ATGATTCAAA CCTGCCAATG TGTGTAGCTA TACCTTATCA CCTCATCTTG TGAAATAACC 300 

AATCATATGT GAACATGTCT ATATGCTTCG TTTAAATC C A CCAATCCCCG TAACTATGCA 360 
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TCTGCTTCTG TACGCCCGCT TCTGCTTCCC CAAACCCTAT AAAAGCC CCA TGCT AG AG CT 4 20 

GTTGGGCGCG CAAGTCCTCC GAAGAGACTG TGTGCCCGCA GGTACCTGTG TTTTCCAATA 4 80 

AACCCTCTTG CTGATTGCAT CCGAGTGGCC TCGGCTCGGT CATTGGGCGC TTGGGGGTCT 54 0 

CCTCCTGAGG GAAAGGTCCT CTCCGGAGGT CTTTTCATTT TGGGGGCTCG TCCGGGATCT 600 

GGAGATCCTC CGCCCAGAGA TCACCGACCA CCCACCGGGA GGTAAGCCGG CCGGCATCTG 66 0 

TCGTGTCTTG CCCTGTCTTG TCTTGTCTTG TCCTGTGCGC GTGTTCAGTT CGTCTCAGTT 72 0 

TTGGACTCAG ATCTGGGTTT TGGTCGAAGG AGAAGGCCCA GGGCTTCGGT TTCTCAGGGT 78 0 

TCAGGACCCT CAGCGCCTCC GTTTGGGCGG GTCAGAGAAG GAGCTGACGA GCTCGGACTT 84 0 

CTCCCCCCGC AGCCCTGGAA GACGTTCCAA GGGTGTCTGG AGCCCGGTTC TTTGGGGCTC 900 

AGCCCGTATC GGAGGGATAC GTGGTTTTGG TTGGAGGAGA GGGTCCAGGA CCCTCGGCAC 960 

CTCCATCTGA CTCTTTGTTT TGGGTTTTAC GTCGAAGCCG CGCGGCGCGT CTGTCTGTTA 102 0 

TTTGTCTGAT CGTTGGATTT GTCTGTCTAA TCTGTGCCCT AATTTTCTTT GAAGCTACCA 1080 

TGGGACAATC GCTAACAACC CCCTTGAGTC TCACTCTAGA CCATTGGAAG GACGTCCGAG 114 0 

ACCGAG CACG TGATCAGTCG GTCGAGATCA AGAAAGGTCC TCTCCGGAGG TCGGGGACAG 1200 

TCGCGCCAGC AAGCGGTGGG GCAGGAGCTC CTGGTTTGGC AGCCCCTGTA GAAGCGATGA 1260 

CAGAATACAA GCTTGTGGTG GTGGGCGCTA GAGGCGTGGG AAAGAGTGCC CTGACCATCC 132 0 

AGCTGATCCA G AAC CATTTT GTGGACGAGT ATGATCCCAC TATAGAGGAC TCCTACCGGA 138 0 

AACAGGTAGT CATTGATGGG GAGACGTGTT TACTGGACAT CTTAGACACA GCAGGTCAAG 1440 

AAGAGTATAG TGCCATGCGG GACCAGTACA TGCGCACAGG GGAGGGCTTC CTCTGTGTAT 1500 

TTGCCATCAA CAACACCAAG TCCTTTGAAG ACATCCATCA GTACAGGGAG CAGATCAAGC 156 0 

GGGTGAAAGA TTCAGATGAT GTGCCAATGG TGCTGGTGGG CAACAAGTGT GACCTGGCCG 162 0 

CTCACACTGT TGAGTCTCGG CAGGCCCAGG ACCTTGCTCG CAGCTATGGC ATCCCCTACA 168 0 

TTGAAACATC AGCCAAGACC CGACCAGGTG TGGAGGATGC CTTCTACACA CTAGTACGTG 174 0 

AGATTCGGCA GCATAAACTG CGGAAACTGA ACCCGCCTGA TGAGAGTGGC CCTGGCTGCA 1800 

TGAGCTGCAA GTGTGTGCTG TCCTGACACC AGGTTAAGGA CCTGATTTTC CGCCAGAAGC I860 

CGTACGGACA CCCTGACCAG GTGGCCTACA TTGTCACCTG GGAGAGCTTG GCATTTAGCC 1920 

CTCCTCCTTG GGCAGAACCC TTTGTGGACC CGAATTGGCT TCCTGTTTCC CCTAAACCTG 198 0 

TTTCCCCGAG CCCACCTGAC CCTTTGGTTG CTTCTTCCTC TCTCTATCCT GCTCTAACTA 2040 

AGGAAGAATC TCCCAAAGTC CCTCCCCCGA AACCTGTCCT CCCAGAGGAC CCAAATTCCC 2100 

CCCTTATAGA TCTCCTGTTG GAAGAACCTC CTCCGTACCC TGTACCTACA GCCCCGCCAA 216 0 

GAGAAGAGGA AG TGGAGC CG CCTGCTAGAC CTCGACTCGA GGCGGCCCCT TCCCCTGTGG 2220 

CTGGAAGACT TCGGGGACGA CGCGAGGTGG CGCCAGACTC CACCTCCCAG GCCTTTCCGC 2280 
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TTAGACAAGG GGCTGGCGGC CAGATACAAT ACTGGCCATT CTCAGCGGCC GACATATATA 2340 

ACTGGAAACA ACACAACCCC CCCTTTTCTA AGGATCCGGT GGCTCTCACC AACCAGATAG 24 00 

AATCTGTCTT GCTTACCCAT CAGCCCACTT GGGATGATAT ACAGCAACTT TTACAGGCCC 2460 

TCCTGACCTC TGAAGAGAAG CAGAGAGTGC TCTTAGAGGC CAGGAAACAT GTTTTGGGGG 2520 

ACAATGGACG CCCCACCTTG CTCCCGAAAG AGATCGATGA TGCATTCCCA CTTACAAGAC 2 580 

CTGATTGGGA TTTCACCACG GCTAAAGGTA GGAGACACCT ACGCCTTTAT CGCCAGTTGC 2 64 0 

TCCTAGCGGG TCTCCGAGGG GCGGCACGAC GCCCCACCAA TTTGGCTCAG GTAAAACAAG 2700 

TGGTACAAGA GGCTGCGGAG ACTCCCTCAG CCTTCCTAGA GAGACTTAAG GAAGCTTATC 2 76 0 

GCATGTATAC CCCTTATGAT CCAGATGATC CAGGACAAAT GACAAATGTC TCCATGTCCT 2 82 0 

TCATCTGGCA GGCAGCACCA GATATCAGGG CCAAGCTACA GAGAATAGAA AATTTACAAG 2880 

GGTATACACT GCAGGATTTA CTTAAGGAGG CAGAAAGAAT TTATAACAAG AGAGAGACAC 2 94 0 

AAGAAGAAAA GAAAGATAAA ATACGTAGAG AAAAAGATGA GAGAGACCGA AAAAGAAACA 3000 

GAGAGTTGAG TCGAATCTTG GCCGCCGTAG TTCAGGGTCA AGAGAAAAGG GGAGAGAGGG 3060 

TGGGAGTTCG AAAGGGGCCA AAGCTAGATA AGGATCAATG TGCGTATTGC AAAGAAAGAG 312 0 

GACACTGGGC CAGAGATTGC CCTAAGAAAC CCAGCGGCTC CGAAGACCCC GCCCACAGAC 3180 

CTCCCTCTTG GCCCTAGATA AAGATTAGGG AGGTCAGGGC CAGGAGCCCC CCCCTGAGCC 324 0 

CAGGATAACT CTTGAAGTTG GGGGGCAGCC AGTCACCTTT CTGGTGGACA CAGGAGCCCA 3300 

GCACTCAGTC CTCACCCAGG CCCCTGGACA ACTCAGCGAC CGGACGGCCT GGGTACAAGG 3360 

AGCCACTGGC AGCAAGAGAT ACCGTTGGAC TACAGATCGA CGGGTTCAGC TGGCTACTGG 34 20 

TAAGGTGACC CATTCCTTCT TACATGTTCC GGACTGCCCA TACCCTCTGC TGGGCCGTGA 34 80 

CTTGCTTACC AAATTAAAAG CTCAGATCCA TTTTGAAGAA GGAGGGACCC GAGTAACCGG 354 0 

GCCCCGCGGT ATTCCTCTTC AGATTTTAAC CCTTCAGTTA GAAGATGAAT ATAGATTATA 36 00 

TGAACCAGAA CAGGACAAGC CAAAATCTCC AGAAATAGAC TCTTGGGTCA CGAAATTCCC 36 6 0 

ACTGGCCTGG GCAGAGACTG GCGGGATGGG GTTGGCGCTC CAACAGCCTC CCCTAATTAT 3720 

CCAGTTAAAG GCCACCGCGA CTCCTGTCTC CATTAAACAG TACCCCATGT CATGGGAAGC 3 780 

TTATCAGGGC ATAAAGCCAC ATATCAGGAG GCTCTTAGAC CAAGGCATCC TAGTCCCTTG 384 0 

CCGGTCACCC TGGAATACGC CTCTGCTACC TGTTAAGAAG CCCGGCACTG GAGACTATAG 3900 

GCCAGTACAA GATTTGAGAG AGGTCAACAA AAGAGTAGAA GATATTCATC CAACTGTCCC 396 0 

AAACCCTTAT AACCTACTCA GCACCCTGCC TCCCACCCAT ACTTGGTATA CGGTCTTAGA 4020 

TCTGAAGGAT GCTTTCTTCT GCCTCCGGCT GAGCCCAGAA AGCCAGCCCT TATTTGCTTT 4 08 0 

TGAGTGGAAA GACTCTGAAA TGGGGCTTTC GGGACAGTTG ACTTGGACAA GGTTACCACA 414 0 

GGGTTTCAAA AACAGCCCAA CGCTCTTTGA TGAGGCCTTA CACCGGGACT TGGCTGACTT 420 0 
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TCGAGTCCAG CATCCCACTC TTATACTTCT TCAGTTTGTT GATGACCTTC TTCTAGGGGC 4260 

CACTTCTGAG ACAGCATGCC ACCAGGGAAC AGAATCCCTC TTGCAGACTT TGGGGCGATT 4 320 

GGGCTATCGA GCTTCTGCCA GAAAGGCTCA AATTTGCCAG ACCCAGGTTA CTTATTTAGG 4 380 

CTATCAACTA AGGGATGGAC AGCGATGGCT GACTCCGGCT AGGAAACAGA CCGTGGCCAA 4440 

CATCCCAGCC CCAAGAAATG GCCGACAGCT ACGGGAATTC 44 80 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GCTGAGTAGT GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGGC CGACAATTGC 60 

ATGAAGAATC TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 120 

ACGCGTATCT GAGGGGACTA GGGTGTGTTT AGGCGAAAAG CGGGGCTTCG GTTGTACGCG 180 

GTTAGGAGTC CCCTCAGGAT ATAGTAGTTT CGCTTTTGCA TAGGGAAGGG GAAATGTAGT 24 0 

CTTATGCAAT ACTCTTGTAG TCTTGCAACA TGCTTATGTA ACGATGAGTT AGCAACATGC 300 

CTTACAAGGA G AG AAAAAG C ACCGTGCATG CCGATTGGTG GAAGTAAGGT GGTACGATCG 360 

TGCCTTATTA GGAAGGCAAC AGACGGGTCT GACATGGATT GGACGAACCA CCGAATTCCG 4 20 

CATTGCAGAG ATATTGTATT TAAGTGCCTA GCTCGATACA ATAAACGCCA TTTGACCATT 480 
CACCACATTG GTGTGCACCT GGGTTGATGG CCGGACCGTT GATTCCCTGA CGACTACGAG 
CACCTGCATG AAGCAGAAGG CTTCA 

■ 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGATCCTCAG GGGTAACACC TTTTGGAGGT GGGCATCTTC CTCATTCTCA GTGGTGCCAA 
GTTCATATCC TGCTGGCTTA ACACGTGGTG TTACTATATT TGTGGCCTTA TATGATTATG 
AAGCTAGAAC TACAGAAGAC CTTTCATTTA AGAAGGGTGA AAAATTTCAA ATAATTAACA 
ATACAGAAGG AGACTGGTGG GAAGCAAGAT CAATCACTAC AGGAAAGAAT GGTTATATCC 
TGAGCAGTTA TGTAGCGCCT GCAGATTCCA TTCAGGCAGA AGAATGGTAT TTTGGCAAAA 
TGGGGAGAAA AGATGCTGAA AGATTACTTC TGAATCCTGG AAATTAATGA GGTATTTTCT 
TAGGAAGAGA GAGTGAAATG GCTGGGTGCA GTGGCTCATG CCTGTAATCC CAGCACTTTG 



540 
565 



60 
120 
180 
240 
300 
360 
420 
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GGAGGCCGAG TTGGGCGGAT CACCTGAGGT CAGGAGTTCG AGACTAGCCT GGCCAACATG 48 0 

GTGAAACCCC ATCTCTACTA AAAAAAAAAG TACAAAATTA GCTGGACGTG GTGGTGAGTG 54 0 

CCTGTAATCC CAGCTACTCA GGAGGCTGAG GCAGCAGAAT CACTTGAACC TGGGAGGCGG 6 00 

AGG TTGCAGT GAGCTGAGAT CGCGCCACTG CACTCCAGCC TCGGCGACAA GAGCAAAAAC 66 0 

TCCGTCTAAA AAACAAATAA GCAAACAGAA CAAAACAAAA CAAAAACGAG AGAGCGAAAC 720 

TACTAAAGGT GCTTATTCCC TCTCTATTCG TGATTGGGAT GAGGTAAGGG GTGACAATGT 780 

GAAACACCAC AAAATTAGGA AACTTGACAA TGGTAGATAC TATATCACAA CCAGAGAACA 840 

ACTTGATACT CTGCAGAAAT TGGCAAAACA CTACACAGAA CATGCTGATG GTTTATGCCA 900 

CAAGTTAACA ACTGTGTGTC CAACTGTGAA ACCTCAGATT CAAGGTCTAG CAAAAGATGC 96 0 

TTGGGAAATC CCTTGATAAT CTTTGCGACT AGAGGTTAAA CTAGGACAAG GATGTTTTGG 1020 

CAAAGTGTGG ATGGGAATAT GGAATGGAAC CACAAAAGTA GCAATCAAAA CACTAAAACC 10 8 0 

AGGTACAATG ATGCCAGAAG CTTTTCTTCA AGAAGCTCAG GTAATGAAAA AAATAAGACA 114 0 

TGGTAAACTT GTTCCACTAT ATGCTGTTGT TTCTGAAGAG CCAATTTACA TTGTCACTGA 12 00 

ATTGATGTCA AAAGGAAGCT TATTCAATTT CCTTAAGGAA GGAGATGGAA AGTATTTGAA 12 60 

GCTTCCACAA ATGGTTGATA TGCCTGCTCA GATTGCTGAT GGTATGGCAT ATATTAAAAG 1320 

AATGAACTAT ATTCACCGAG ATCTCTGGGC TGCTAATATT CTTGTAGGAG AAAATCTTCT 13 80 

GTGCAAAATA GCAGATTTTG GTTTAGCAAG GTTAATTGAA GACAATGAAT ACACATCAAG 144 0 

ACAAGGTGCA GAATTTCCAA TCAAATGGAC AGCTCCTGAA GTTGCACTGT ATGGTGGGTT 1500 

TACAATAAAG TCTGGTGTCT GCTCATTTGG AATTCTACAG ACAGAACTGG TAACAAAGGG 156 0 

CAGAGTGCCA TATCCAGGTA TGGTGAACCA TGAAATACTG GAACAGGTGG AGCGAGGATA 16 2 0 

CAGGATGCCT TGCCCTCAGG GCTGTCCAGA ATCCCTCCAT GAATTGATGA ATCTGTGTTG 16 8 0 

GAAGAAGGAC CCTGATGAAA GACCAACATT TGAATATGTT CAGTCCTTCT TGGGAGACTA 1740 

CTTCACTGCT ACAGAGCCAT AGTACCAGCC AGGAGAAAAC TTCTAATTCA AGTAGCCTAT 1800 

TTTA 18 04 
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Claims 

1 . A cellular immunogen for immunizing a host against the 
effects of the product of a target proto-oncogene, the overexpression of which 
target proto-oncogene is associated with a cancer, which cellular immunogen 
comprises host cells which have been transfected with at least one transgene 
construct comprising at least one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of the transgene in the transfected 
cells, the transgene encoding a gene product which induces host 
immunoreactivity to host self-determinants of the product of the target proto- 
oncogene gene. 

2. An immunogen according to claim 1 wherein the transgene 

comprises 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 

3. An immunogen according to claim 2 wherein the transfected 
cells are non-dividing. 

4. An immunogen according to claim 2 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

5. An immunogen according to claim 4 wherein the mutant DNA 
is nontransforming. 



6. An immunogen according to claim 5 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 
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7. A cellular immunogen according to claim 6 wherein the host 
cells have been transfected with a plurality of transgene constructs, each 
construct encoding a different deletion mutation. 

8. An immunogen according to claim 1 wherein the host cells 
have been transfected with a transgene cognate to a target proto-oncogene 
selected from the group of proto-oncogenes consisting of AKT-2, c-erbB-2, 
MDM-2, c-myc, c-myb, c-ras, c-src and c-yes. 

9. An immunogen according to claim 1 wherein the cells 
comprise fibroblasts. 

10. A method for preparing a cellular immunogen for 
immunizing a host against the effects of the product of a target proto-oncogene, 
the overexpression of which target proto-oncogene is associated with a cancer, 
the method comprising: 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of 
the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene. 



comprises 



11. A method according to claim 11 wherein the transgene 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 
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12. A method according to claim 1 1 wherein the transfected cells 
are non-dividing. 

13. A method according to claim 11 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

14. A method according to claim 13 wherein the mutant DNA 
is nontransforming. 

15. A method according to claim 14 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 



16. A method according to claim 15 wherein the host cells are 
transfected with a plurality of transgene constructs, each construct encoding a 
different deletion mutation. 



17. A method according to claim 11 wherein the transgene is 
cognate to a target proto-oncogene selected from the group of proto-oncogenes 
consisting of AKT-2, c-^5-2, MDM-2, z-myc< c-myb, c-ras, c-src and c-m. 

18. A method according to claim 1 wherein the excised cells 
comprise fibroblasts. 



19. A method of vaccinating a host against disease associated 
with the overexpression of a target proto-oncogene comprising 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
one transgene cognate to the target proto-oncogene 
and a strong promoter to drive the expression of 
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the transgene in the transfected cells, the 
transgene encoding a gene product which induces 
host immunoreactivity to host self-determinants of 
the product of the target proto-oncogene gene; 

(c) returning the excised cells transfected 
with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 



20. A method according to claim 19 wherein the transgene 

comprises 

wild-type or mutant retroviral oncogene DNA; or 
wild-type or mutant proto-oncogene DNA of a species 
different from the host species. 

21 . A method according to claim 20 wherein the transfected cells 
are rendered non-dividing prior to return to the body of the host. 

22. A method according to claim 20 wherein the transgene 
comprises mutant retroviral oncogene DNA or mutant proto-oncogene DNA. 

23. A method according to claim 22 wherein the mutant DNA 
is nontransforming. 

24. A method according to claim 23 wherein the mutant DNA 
comprises a deletion mutation in a region of said DNA which is essential for 
transformation. 



25. A method according to claim 24 wherein the host cells are 
transfected with a plurality of transgene constructs, each construct encoding a 
different deletion mutation. 
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26. A method according to claim 19 wherein the transgene is 
cognate to a target proto-oncogene selected from the group of proto-oncogenes 
consisting of AKT-2, c-erbB-2, MDM-2, c-rnyc, c-myb, c-ras, c-src and c-yes. 

27. A method according to claim 19 wherein the excised host 
cells comprise fibroblasts. 



28. A method of vaccinating a host against disease associated 
with the overexpression of a targeted proto-oncogene comprising 

(a) excising cells from the host; 

(b) transfecting the excised cells with at 
least one transgene construct comprising at least 
transgene and a strong promoter to drive the 
expression of the transgene in the transfected 
cells, wherein the transgene comprises 

(1) wild-type or mutant cognate retroviral 
oncogene DNA; or 

(2) wild-type or mutant cognate proto- 
oncogene DNA of a species different from the 
host species; 

(c) returning the excised cells transfected 
with the transgene construct to the body of the 
host to obtain expression of the transgene in the 
host. 
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